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^ ' 4-31192-CIP 

SELECTABLE CELL SURFACE MARKER GENES 

This application claims the benefit under 35 USC § 119(e) of the following United States 
provisional patent applications: (1) Provisional Application No. 60/166,594, filed November 
5 19, 1999, for "Selectable Cell Surface Marker Genes;" (2) Provisional Application No. to be 
assigned, filed November 19, 1999, as Application No. 09/444,038 for "Selective Marker 
Genes," and subject to a Petition for Conversion to Provisional Application, filed November 16, 
2000; and (3) Provisional Application No. to be assigned, filed March 30, 2000 as Application 
No. 09/539,248 for "Selectable Marker Genes," and subject to a Petition for Conversion to 
10 Provisional Application, filed November 16, 2000. The disclosures of these three provisional 
applications are incorporated herein by reference in their entirety. 

This invention relates to a method of identifying genetically modified cells using a 
mutated protein-tyrosine kinase receptor (PTKR), particularly a mutated epidermal growth 
15 factor receptor (EGFR) family member or a mutated muscle specific kinase (MuSK) family 
member as a selectable cell marker. 

BACKGROUND OF THE INVENTION 

20 The use of selectable markers is well known for the identification of prokaryotic and 

eukaryotic cells, and the use is essential because frequently when a DNA sequence of interest is 
introduced into a cell it will not necessarily lead to a phenotype that is readily determined. The 
number of selectable markers used in identifying eukaryotic cells and especially mammalian 
cells has been limited, hi the past, selectable markers that conferred drug resistance were 

25 employed (i.e. G-418 and hygromycin). More recently, selectable markers that are combined 
with fluorescence activated cell sorting (FACS) have been used, for example, green fluorescent 
protein (GFP). Altematively, antibodies that recognize a cell surface molecule may be coupled 
to a fluorophore to help identify the cells of interest. 

30 Several cell surface molecules have been used as a selectable cell marker including murine 

CDS, CD24, and human Low-Affinity Nerve Growth Factor Receptor (NGFR). Reference is 
made to the following pubKcations; WO95/06723; WO98/19540; Jolly, et al., Proc. Natl. Acad. 
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Sci. 80: 477 (1983); and Reddy, et al., Mol. Brain Res. 8:137 (1990). Some of these cell surface 
molecules have been mutated in their intracellular domain to avoid signaling of the molecule 
when binding to their ligand. However upon Ugand binding some of the intracellularly mutated 
molecules may homo-, heterodimerize or trimerize. If a newly introduced molecule in the cell 
5 should heterodimerize with endogenous receptors a dominant negative effect may result. The 
present invention provides cell surface markers that do not heterodimerize with endogenous 
receptors. 

SUMMARY OF THE INVENTION 

10 

The present invention provides a method of identifying genetically modified mammalian 
cells comprising introducing a nucleic acid sequence encoding a mutated protein-tyrosine kinase 
receptor (PTKR) operatively linked to an expression control sequence into a cell to form a 
genetically modified cell; allowing expression of the mutated PTKR in the genetically modified 

15 cell; and identifying said genetically modified cell expressing the mutated PTKR. In a preferred 
embodiment the mutated PTKR is a mutated epidermal growth factor receptor (EGFR) family 
member (or MuSK family member), particularly preferred is a mutated EGFRl and more 
specifically the sequences given the designation EGFRl -I and EGFRl -U. In a second 
embodiment the introducing step is accomplished by incorporating the nucleic acid sequence 

20 encoding the mutated PTKR, and particularly the mutated EGFR into a vector and introducing 
the vector into a cell. The preferred vectors are retroviral vectors. In a fiirther embodiment the 
mammalian cells are human cells, particularly hematopoietic cells, hver cells, endothehal 
vascular cells, and smooth muscle cells, more particularly hematopoietic cells. In yet another 
embodiment a heterologous gene is incorporated into the construct or vector comprising the 

25 mutated PTKR marker sequence. In yet a fiirther embodiment the identifying step is 
accomplished by contacting the genetically modified cells with an antibody that recognizes and 
binds to the mutated PTK^. In yet another embodiment the identifying step separates the 
genetically modified cells from the non-genetically modified cells. 

30 la a second aspect, the invention provides a method of identifying genetically modified 

mammalian cells comprising the steps of incorporating into a vector a nucleic acid sequence 



i 

encoding a mutated epidermal growth factor receptor (EGFR) family member operatively linked 
to an expression control sequence; introducing the vector into a mammalian cell to form a 
genetically modified cell; allowing expression of the mutated EGFR in the genetically modified 
cell; and identifying said genetically modified cell expressing the mutated EGFR. In a preferred 
5 embodiment the mutated EGFR is EGFRl-I or EGFRl-H and the vector is a retroviral vector. 

hi a third aspect, the invention is directed to a method for the immunoselection of 
transduced mammalian cells comprising, retrovirally transducing mammalian cells with a 
nucleic acid sequence encoding a mutated epidermal growth factor receptor (EGFR) family 

10 member operatively linked to an expression control sequence; incubating the transduced cells 
with a marked antibody that recognizes and binds specifically to the mutated EGFR; and 
identifying the marked transduced cells, hi a preferred embodiment the cells are human cells, 
particularly hematopoietic cells, hi another embodiment the cells are transduced by a retroviral 
vector derived fi:-om the group consisting of moloney murine leukemia virus (MoMLV), 

15 myeloproliferative sarcoma virus (MPSV), murine embryonic stem cell virus (MESV), murine 
stem cell virus (MSCV) and spleen focus forming virus (SFFV). hi a further embodiment the 
cells are transduced with a vector derived from a lentivirus. hi yet a fiirther embodiment the 
method includes the step of separating the identified marked transduced cells fi-om non-marked 
cells. In yet another embodiment the method includes the step of expanding the marked 

20 transduced cells. 

In a fourth aspect, the invention is directed to a method of identifying mammahan cells 
expressing a protein of interest, comprising the steps of, introducing into a mammalian cell a 
nucleic acid sequence encoding a mutated PTKR operatively linked to an expression control 
25 sequence and a nucleic acid sequence encoding a protein of interest; culturing the resulting 
mammalian cells; and identifying cells which express the mutated PTKR thereby obtaining cells 
which express the protein of interest. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A is a schematic representation of a wild-type (WT) EGFR molecule and two 
5 mutated EGFRs: EGFRl-I with amino acid residues deleted from the cytoplasmic domain and 
EGFRl-II with amino acid residues deleted from both the cytoplasmic and extracellular 
domains. Fig. IB illustrates the wild-type and the mutated MuSK. 

Figure 2 illustrates the amino acid sequence of a wild-type (WT) EGFRl and 
corresponds to SEQ ID NO: 2. The signal sequence is represented as amino acid residue 1 to 24. 
10 The extracellular domain includes amino acid residues 25 through 645, the transmembrane 
domain includes amino acid residues 646 through 668, and the cytoplasmic domain includes 
amino acid residues 669 through 1210. The tyrosine kinase domain is located at amino acid 
residues 718 though 964, and the threonine phosphorylation site is located at amino acid residue 
678. 

15 Figure 3 illustrates the nucleotide sequence of the wild-type (WT) EGFRl encoding the 

amino acid sequence of Figure 2 and corresponds to SEQ ID NO: 1. 

Figure 4 illustrates the general scheme used to generate a deletion in the extracellular 
domain and intracellular domain of the EGFR sequence depicted in Figure 2 and 3. 

Figure 5 illustrates expression of EGFRl -II on primary T cells and CD34"^ cells after 
20 transduction with supematants that were generated from PPA-6 cells. Panel A corresponds to 
untransduced cells, panel B illustrates expression after transduction and panel C illustrates 
expression after immuno-magnetic beadselection for mutated EGFR expression on T-cells. 
Panel D corresponds to untransduced cells and panel E illustrates expression of CD34^ cells. 

Figure 6 illustrates a MuSK-R designated hMuSK-R and corresponds to the nucleic acid 
25 sequence as set forth in SEQ ID N0:7 and the amino acid sequence as set forth in SEQ ID NO: 
8. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The practice of the present invention will employ, unless otherwise indicated 

conventional techniques of cell biology, molecular biology, cell culture, immunology, virology. 
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and the like which are in the skill of one in the art. These techniques are fully disclosed in the 
current literature and reference is made specifically to Sambrook, Fritsch and Maniatis eds., 
"Molecular Cloning, A Laboratory Manual", 2"^ Ed., Cold Springs Harbor Laboratory Press, 
(1989); Celis, J.E. "Cell Biology, A Laboratory Handbook", Academic Press, Inc. (1994); 
5 CoUgan et al., "Current Protocols in Immunology", John Wiley and Sons (1991); and Harlow et 
al., "Antibodies: A Laboratory Manual" (1988), Biosupplynet Source Book (1999), Cold 
Springs Harbor Laboratory. 

All publications and patent appUcations cited in this specification are indicative of the 
level of skill of those skilled in the art to which this invention pertains, and are hereby 
1 0 incorporated by reference in their entirety. 

As used in this specification and the claims, the singular form "a", "an" and "the" 
include plural references unless the context clearly dictates otherwise. For example, a stem cell 
includes a pluraUty of stem cells. 

15 The selectable marker of the instant invention is a mutated protein-tyrosine kinase 

receptor (PTKR) molecule. PTKR molecules are activated by polypeptide hgands, and are 
closely related in their catalytic domains. They are Type I transmembrane proteins, with their N- 
termini outside the cell and single membrane-spanning regions. (See van der Geero et aL, Annu. 
Rev. Cell Biol. 10:251 (1994)). As used herein PTKRs include but are not limited to the 

20 following subfamilies, MuSK-R which is believed to initiate the formation of neuromuscular 
jimctions in response to agrin (Glass, et al Cell 85:513 (1996), epidermal growth factor receptor 
(EGFR), platelet-derived growth factor receptor (PDGFR), insulin receptor (INSR), nerve 
growth factor receptor (NGFR), fibroblast growth factor receptor (FGFR), and EPH. Each 
subfamily is comprised of various members. The PDGFR family includes PDGF receptor a, 

25 PDGF receptor p, SCF receptor (c-Kit), CSF-1 receptor (c-Fms), Flk2 (Flt3), FLTl (Quekl), 
Flkl (KDR), and FLT4 (Quk2). The INSF subfamily includes insulin receptor, IGF-1 receptor, 
IRR, ROS and Ltk. The NGFR subfamily includes NGFR receptor (TrkA), TrkB, and TrkC. 
The EGFR family includes EGFRl, EGFR2, EGFR3, and EGFR4. One skilled in the art is 
aware of alternative nomenclature for these members e.g. EGFRl has been referred to as EGFR, 

30 HER, c-ErbB, and ErbB-1; EGFR2 has been referred to as HER2, Neu and ErbB-2; EGFR3 has 
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been referred to as HERS and ErbB-3; and EGFR4 has been referred to as HER4 and ErbB-4. 
(UMch and Schlessinger, Cell, 61: 203-212 (1990)). 

The mutated PTKRs of the present invention include a modification in a PTKR molecule 
so that the receptor no longer possesses the signaling activity of the corresponding unmodified 
5 (wild type) receptor. For the purpose of the present disclosure, "Wild-Type PTKRs" include not 
only naturally occurring PTKRs but also may include genetically engineered PTKRs molecules 
wherein the PTKR molecule has undergone changes in the DNA sequence that do not 
significantly effect the properties of the protein tyrosine kinase receptor molecule. These 
changes include ones that do not change the encoded amino acid sequence, ones that result in 

10 conservative substitutions of amino acid sequences, or ones that result in one or a few amino 
acid deletions or additions. Suitable substitutions are known by those skilled in the art. Amino 
acid residues which can be conservatively substituted for one another include, but are not 
Umited to, glycine/alanine; valine/isoleucine/leucine, asparagine/glutamine; asparatic 
acid/glutamic acid; serine/threonine; lysine/arginine; and phenylalanine/tyrosine. 

15 A modification of the wild-type PTKR molecule may include deletions, truncations and 

the like. More specifically, mutated PTKRs of the invention include a modification, however, 
obtained, in the cytoplasmic domain causing the molecule to be devoid of signaling activity. 
SignaUng activity may be generally defined as activity that triggers a response pathway in the 
cytosol to the nucleus that ultimately leads to activation of transcription. The mutated PTKRs 

20 according to the invention may include a modification in the extracellular domain. However, 
preferably the extracellular domain should still be capable of binding an antibody. Li general, 
the smallest peptide firagment of the extracellular domain capable of binding an antibody would 
be approximately 15 amino acid residues, more preferably at least 50 amino acid residues. 

In a preferred embodiment, the selective marker PTKR is a mutated member of the 

25 EGFR family. The members of the EGFR family have similar overall topology and significant 
relatedness in amino acid sequence. Members of this family are not limited to the above- 
enumerated members, but include new members sharing common features, such as a single 
transmembrane domain; two cysteine rich regions of approximately 100 to 200 amino acid 
residues, and a single tyrosine kinase domain. Further, members of the EGFR family can 

30 heterodimerize with other members of the EGFR family. 
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A preferred mutated EGFR family member is EGFRl. EGFRl is a high affinity receptor 
for a number of ligands including but not limited to epidermal growth factor (EOF), 
transforming growth factor a (TGFa) and amphiregulin. Various EGFR sequences have been 
identified and reference is made to EMBL/GenBank Accession Numbers: X00588/X06370 
5 (Ullrich, et al, Nature 309:418 (1984)); K01885/K02047, (Lin et al, Science 224:843 (1984)); 
K03193 (Merlino, et ah, MoL Cell. Biol 5:1722 (1985)): X00663 (Xu, et al. Nature 309:806 
(1984); AC006977; M38425 (Simmen, et ah, Biochem. Biophys. Res. Commun. 124:125 
(1984)); and M20386 (Lax, et al, Mol. Cell. Biol. 8:1970 (1988)). Several EGFRs have been 
identified in invertebrate species, for example Drosophila having EMBL/GenBank Accession 

10 Number K03054 (Livneh, et al., Cell 40:599 (1985)). 

The domain structure of wild-type EGFR is schematically illustrated in Figure 1 . EGFR 
is comprised of a signal peptide that targets the protein to the secretory pathway. The 
extracellular domain follows the signal sequence. This domain is made up of several hundred 
amino acids. The extracellular domain means the part of the receptor that normally projects 

15 from the cell into the extracellular environment and in the EGFR it contains distinctive Cys 
residues. The transmembrane domain is generally locahzed in the cell membrane and consists of 
a stretch of hydrophobic residues followed by several basic residues. The cytoplasmic domain 
(also referred to as the intracellular domain) includes the catalytic part of the molecule and is 
positioned within the cell. This domain contains the substrate binding site and regulatory 

20 tyrosine and threonine phosphorylation sites. 

Modifications of EGFR family members are known, and particularly mutations of 
EGFRl are known. WO 93/05148 discloses three mutations of EGFRl. One mutation (referred 
to as HER721 A) is a point mutation in the cytoplasmic domain of EGFRl at amino acid position 
721. The lysine is changed to alanine. A second mutation (referred to as HERCD-533) involves 

25 the C-terminal end of the molecule wherein the intracellular domain was deleted but the 
transmembrane domain was not. A third mutation (referred to as HERCD-566) includes the C- 
terminal deletion of 566 amino acids, both the intracellular and transmembrane domains were 
deleted. Three truncated forms of EGFR have been detected in some maKgnancies: Type I 
(EGFRvIII) has a large deletion of the extracellular domain (amino acids 6 - 273) and does not 

30 bind EGF (Moscatello, D.K. et al., Oncogene 13:85 (1996)). Type II contains an in-frame 
deletion of 83 amino acids (520-603) in extracellular domain IV that does not prevent EGF and 
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TGFa binding (Humphrey, P.A. et al, Biophys. Res. Commun. 178:1413 (1991)). Type III has 
an in-frame deletion of 267 amino acids (29-296) in extracellular domains II and III. These two 
mutations prevent ligand binding (Humphrey, P.A. et al., Proc. Natl. Acad. Sci. USA 87:4207 
(1990)). 

5 While specific modifications such as, but not limited to those enumerated above may be 

known for the EGFR members, the use of a mutated EGFR family member as a selectable 
marker in mammalian cells is not known. According to the invention preferred modifications to 
the PTKR molecules and particularly to EGFR molecules include modifications to the 
cytoplasmic domain, such as deletions of at least 150, preferably at least 250, more preferably at 

10 least 400, and still more preferably at least 500 amino acids of the cytoplasmic domain. In a 
preferred embodiment, the deletions will consist of a truncation of the molecule. Truncations 
may include deletion of tyrosine phosphorylation sites in the range of 1 to 15 sites and deletion 
of the kinase catalytic site. However, a mutated EGFR usefixl as selectable marker in the 
invention may include more than 1 and fewer than 15 deletions of tyrosine phosphorylation 

15 sites. Additionally, the above modifications may include in-frame deletions, such as deletions of 
at least 50 amino acids of the cytoplasmic domain. These deletions may include deletion of the 
protein tyrosine-kinase activity. Particularly preferred selectable marker sequences are mutated 
EGFRl, EGFR2 and EGFR3 molecules wherein at least 400 and preferably 500 amino acids of 
the cytoplasmic domain are deleted from the corresponding wild-type molecule. 

20 In the present invention, a preferred mutated EGFR molecule is a mutated EGFRl. 

Particularly preferred is a modified sequence derived from EGFRl as illustrated in Figures 2 
and 3 (SEQ ID Nos. 1 and 2). In this EGFRl molecule, the extracellular domain is encoded by 
nucleotides 1 through 1935, the transmembrane domain is encoded by nucleotides 1938 through 
2004, and the intracellular domain is coded by nucleotides 2007 to 3630. One non-limiting 

25 example of a preferred mutated EGFRl marker is designated EGFRl -I wherein amino acid 
sequence 679 to 1210 of the cytoplasmic domain as illustrated in Figure 2 is deleted. 

Iri a second preferred embodiment, the selective marker PTKR is a mutated member of 
the MuSK-R family. MuSK-R is comprised of a signal sequence or leader sequence that targets 
the protein to the secretory pathway. The extracellular domain follows the signal sequence. This 

30 domain is made up of several hundred amino acids, and while the exact number of amino acid 
residues varies, typically the extracellular domain includes around 500 amino acids. The 
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extracellular domain is the part of the receptor that normally projects from the cell into the 
extracellular environment and includes a ligand binding region. The extracellular domain is one 
of the most distinctive features of the kinase receptors. Li MuSK-R, the extracellular domain 
contains immunoglobulin-like (Ig-Hke) regions. Typically four Ig-like regions are found. 
5 However there are reports of MuSK-Rs with three Ig-like regions. The extracellular domain may 
include 6 contiguous cysteine residues known as a C6-box. While the location of the C6-box 
may vary depending on the particular MuSK-R, in certain MuSK-Rs it is found approximately 
at amino acid residues 373 - 382. The transmembrane domain is generally localized in the cell 
membrane and consists of a stretch of hydrophobic residues followed by several basic residues. 

10 The intracellular domain (used interchangeably with the cytoplasmic domain) includes the 
catalytic part of the molecule and is positioned within the cell. 

MuSK-Rs are also known in the art as denervated muscle kinase receptors and have been 
referred to as DmKs (see U.S. Pat. No. 5,656,473 and particularly SEQ ID NOS: 16 and 17 
therein). MuSK-R sequences have been isolated and identified from humans, rats, mice, and 

15 xenopus. Closely related to himian MuSK-R is a receptor isolated from the electric ray Torpedo 
californica designated Torpedo tyrosine kinase receptor, and ROR tyrosine kinase receptors 
(Jennings, et al. Proc. Natl Acad, ScL USA 90:2895 (1993); Masiakowski et al., J. BioL Chem. 
267: 26181 - 26190 (1992); Valenzuela et al, Neuron, 15:573-584 (1995); and Hesser et al, 
FEES Letters, 442:133- 137 (1999)). 

20 Other non-limiting examples of MuSK-Rs available from public depositories such as 

GeneBank and ATCC include accession numbers: NM005592; AF006464; A448972; 
AI800924; AI700028; AI341265; AI341122; AI302067; U34985; AA448972; and ATCC 
75498. As mentioned above MuSK-R is specific to the skeletal muscle lineage. 

The term MuSK-R as used in the present specification and claims is broadly defined to 

25 include the known MuSK-Rs (including DmK receptors), isoforms or variants of known MuSK- 
Rs having similar structure, tyrosine kinase receptors that are fimctionally similar to known 
MuSK-Rs and novel MuSK-Rs not previously described that are identified using screening 
techniques well known to those in the art. Such techniques may include the use of degenerate 
oligodeoxyribonucleotide primers. 

30 Accordingly, the term MuSK-R when referring to a nucleic acid molecule includes (a) 

nucleic acid sequences comprising a coding region of a known mammalian MuSK-R; (b) a 
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nucleic acid sequence which hybridizes under stringent conditions to the nucleic acid of (a) and 
which encodes a mammalian MuSK-R; and (c) a degenerate MuSK-R wherein the MuSK-R has 
undergone changes in its nucleic acid sequence that does not significantly effect the properties 
of the MuSK-R protein encoded by the polynucleotide. These changes include ones that do not 

5 change the encoded amino acid sequence, ones that result in conservative substitutions of amino 
acid sequences, or ones that result in one or a few amino acid deletions or additions. Suitable 
substitutions are known by those skilled in the art. Amino acid residues that can be 
conservatively substituted for one another include but are not limited to, glycine/alanine; 
vahne/isoleucine/leucine, asparagine/glutamine; asparatic acid/glutamic acid; serine/threonine; 

10 lysine/arginine; and phenylalanine/tyrosine. Any conservative amino acid substitution not 
significantly affecting the properties of a MuSK-R is encompassed by the term. MuSK-R 
includes not only naturally occurring MuSK-Rs but also may include genetically engineered 
MuSK-Rs. 

The term MuSK-R when referring to a polypeptide encompasses known MuSK 
15 receptors, isoforms or variants of MuSK-Rs, and functionally equivalent receptors. A 
functionally equivalent receptor is a MuSK-R that can compete with a known MuSK-R for 
binding. More specifically, a functionally equivalent MuSK-R has at least 40%, preferably at 
least 60%, and more preferably at least 80% identical amino acids to the sequence set forth in 
SEQ ID NO: 8 and can compete with the MuSK-R illustrated in figure 6 for ligand or substrate 
20 binding. 

According to the invention mutated MuSK-R are used as selective markers to identify 
genetically modified cells. The marker is introduced on a nucleic acid construct into a target cell 
that normally does not express a MuSK-R. The term "introduced" is broadly used herein to 
include inserted, incorporated and the like. When the mutated MuSK-Rs are used as selectable 
25 markers the molecule no longer possesses signaling activity. Signaling activity has be generally 
defined as triggering a response pathway in the cytosol to the nucleus which ultimately leads to 
activation of transcription. The lack of signaling activity may be due to a) use of a MuSK-R in 
tissue or cells other than muscle (Glass et al. Cell 85:513-523 (1996)) or b) use of a mutated 
MuSK-R. 

30 While modifications of MuSK-R may be known, the method of identifying genetically 

modified cells comprising using a mutated MuSK-R as a selectable marker is not known. 
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As stated above, the localization of MuSK-R is in muscle tissue and MuSK-R serves as 
the functional agrin receptor. Agrin is a factor that can induce molecular reorganizations at the 
motor endplate. Therefore, MuSK-R may be used as a selective marker in tissue other than 
muscle. 

5 In a preferred embodiment, the modifications to a MuSK-R include truncations and/or 

deletions of MuSK-Rs. The mutation may occur in the extracellular domain and/or the 
intracellular domain by means well known in the art. The mutation causes the molecule to be 
devoid of signaling activity. Preferably the extracellular domain should still be capable of 
binding an antibody. In general the smallest peptide fragment of the extracellular domain 

10 capable of binding an antibody would be approximately 15 amino acid residues, more preferably 
at least 50 amino acid residues. 

A preferred MuSK-R according to the invention is the sequence set forth in SEQ ID 
NOs: 7 and 8, designated herein as hMuSK-R. The extracellular domain is encoded by 
nucleotides 1 through 1479, the transmembrane domain is encoded by nucleotides 1480 through 

15 1545, and the intracellular domain is encoded by nucleotides 1546 through 2607. Other 
preferred MuSK-Rs are molecules closely related to the sequences set forth in SEQ ID NO: 8 
and 7. Examples of closely related sequences are the sequences set forth in U.S. Pat No. 
5,656,473 particularly NO: 16 and 17 disclosed therein. 

Mutants of MuSK-R are known and reference is made to Apel et al.. Neuron 18:623 - 

20 635 (1997). In the present invention, preferred modifications to a MuSK-R include 
modifications to the cytoplasmic domain such as deletions of at least 150, preferably at least 
200, more preferably at least 250, more preferably 300, and most preferably at least 350 amino 
acids of the cytoplasmic domain. The deletions are preferably truncations. Deletions or 
truncations may include deletion of tyrosine phosphorylation sites in the range of 1 to 19, 

25 preferably 2-15, more preferably 2-10 sites. Additionally the kinase catalytic site may be 
deleted from a MuSK-R. In one aspect, the kinase catalytic site is found at approximately amino 
acid residues 672 to 691 of SEQ ID NO: 8. As long as the protein is stably expressed, there is no 
limitation to the number of sequences deleted or truncated in the cytoplasmic domain. 

Particularly preferred mutated MuSK-Rs usefiil as selectable markers according to the 

30 invention include modifications to the MuSK-R sequence set forth in Figure 6 (SEQ ID NO: 8). 
In one embodiment the MuSK-R is truncated by least 300 amino acid residues in the 
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cytoplasmic domain. One preferred embodiment includes the deletion of amino acid sequence 
538 - 869 and is designated mMuSK-RI. Another preferred embodiment includes the deletion of 
amino acid sequence 577 - 869 and is designated mMuSK-RIL 

In addition to modification of the C)1;oplasmic domain mutations may be made in the 
5 extracellular domain. The extracellular domain modification may include deletion of at least 
about 100 amino acids, preferably at least about 150 amino acids, more preferably at least about 
200 amino acids, and still more preferably at least about 250 amino acids. A mutated MuSK-R 
used as a selectable marker according to the invention preferably should contain an antibody- 
binding site in the extracellular domain. 

10 In another preferred embodiment, a mutated PTKR, particularly a mutated EGFR or 

mutated MusSK-R useful as a selectable marker includes both modification of the cytoplasmic 
domain such as described above, and modification to the extracellular domain. This is expected 
toto avoid heterodimerization of the mutated PTKR with endogenous receptors. 

The extracellular domain modifications may include deletion of at least about 100 amino 

15 acids, preferably at least about 150 amino acids, more preferably at least about 200 amino acids, 
and still more preferably at least about 250 amino acids. In one embodiment, the modification 
will include removal of one cysteine-rich area generally characterized by an area of about 100 
amino acids. Particularly preferred marker sequences are mutated EGFRl, EGFR2 and EGFR3 
molecules having not only a truncation in the cytoplasmic domain but also a deletion of at least 

20 200 and preferably at least 250 amino acids of the extracellular domain. As mentioned above, a 
mutated PTKR as a selectable marker useful in the methods of the invention should preferably 
include an antibody-binding site in the extracellular domain. 

More particularly, a preferred mutated EGFR molecule wherein both the extracellular 
domain and the cytoplasmic domain are modified is a sequence derived firom WT EGFRl 

25 illustrated in Figure 2 (SEQ ID Nos. 1 and 2). Particularly preferred is a mutated EGFRl 
sequence designated EGFRl-II wherein amino acid sequence 25 through 313 of the extracellular 
domain and amino acid sequence 679 through 1210 of the cytoplasmic domain of Figure 2 (SEQ 
ID NO. 2) are deleted. 

As used herein a "mutated PTKR" or a "mutated EGFR" or a "mutated MuSK-R" 
30 selectable marker refers to nucleotides or protein as appropriate fi-om context. Polynucleotides 
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of the invention may be in the form of RNA or in the form of DNA, which DNA includes 
cDNA, genomic DNA or synthetic DNA. 

General strategies for creating mutations in nucleic acids and proteins are well known. 
These methods may be used to create mutant PTKRs, particularly mutant EGFR family 

5 members useful as selectable markers in the present invention. One skilled in the art is aware 
that PTKR molecules including members of the EGFR family may be obtained from sources 
such as various sequence databases including GenBank. Both random and site-directed 
mutagenesis methods may be effective to create mutations in wild type PTKRs. Random 
methods encompass altering the sequences within restriction endonuclease sites, inserting an 

10 oHgonucleotide linker into a plasmid, using chemicals to damage plasmid DNA, and 
incorporating incorrect nucleotides during in vitro DNA synthesis. However, site-directed 
mutagenesis may be a more beneficial tool. Particularly preferred site-directed methods include 
oligonucleotide-directed mutagenesis and polymerase chain reaction (PGR) -amplified 
ohgonucleotide-directed mutagenesis. These methods are also well known in the art and 

15 reference is made to Wu, et al., eds. Methods in Enzymology, Vol. 154: Recombinant DNA, 
Part E, Academic, NY (1987); Landt, et al., Gene, 96:125 - 128 (1990); Kirchhoff et ak. 
Methods MoL Biol 57:323-333 (1995); Herlitze, et al, Gene 91:143-147 (1990); Sambrook, et 
al. Molecular Cloning, A Laboratory Manual, supra ^ and Current Protocols in Molecular 
Biology, Greene PubUshing Associates, John Wiley & Sons, Lie Canada (1988). 

20 The usefulness of a mutated PTKR as a selectable marker concems the ability to identify 

genetically modified manmialian cells in vitro, ex vivo, and in vivo. While the mutated PTKR 
sequence may be introduced into a target cell as part of a nucleic acid construct operatively 
linked to an expression control sequence, in a preferred embodiment the construct including the 
mutated PTKR sequence is placed in a vector and then introduced into a target cell. As used 

25 herein "operatively linked" refers to an arrangement of elements wherein the components are 
configured so as to perform their usual function. The control elements need not be contiguous 
with the coding sequence. 

Vectors containing both a promoter and a cloning site into which a polynucleotide can be 
operatively linked are well known in the art. Such vectors are capable of transcribing RNA in 

30 vitro or in vivo, and are commercially available from sources such as Stratagene (La JoUa, CA) 
and Promega Biotech (Madison, WI). Examples of vectors include vectors derived from viruses, 
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such as baculoviras, retrovirases, adenovirases, adeno-associated viruses, and herpes simplex 
viruses; bacteriophages; cosmids; plasmid vectors; fungal vectors; synthetic vectors; and other 
recombination vehicles typically used in the art. These vectors have been described for 
expression in a variety of eukaryotic and prokaryotic hosts and may be used for protein 
5 expression. 

Li a preferred embodiment, the vector comprises a nucleic acid sequence coding for a 
selective marker according to the invention, operatively linked to an expression control 
sequence. Selection of appropriate control sequences is dependent on the target cell used and the 
choice is within the skill of one in the art. Examples of expression control sequences, also 

10 referred to as regulatory sequences, include promoters, enhancers, polyadenylation signals, 
RNA polymerase binding sequences, sequences conferring inducibility of transcription and 
other expression control elements, such as scaffold attachment regions (SARs). 

The promoter may be either a prokaryotic or eukaryotic promoter. Additionally the 
promoter may be a tissue specific promoter, inducible promoter, synthetic promoter, or hybrid 

15 promoter. More than one promoter may be placed in the construct. Examples of promoters 
include but are not Kmited to the phage lamda (PL) promoter; S V40 early promoter; adenovirus 
promoters, such as adenovirus major late promoter (Ad MLP); herpes simplex virus (HSV) 
promoter; a cytomegalovirus (CMV) promoter; such as the human CMV immediate early 
promoter; a long terminal repeat (LTR) promoter, such as a MoMLV LTR; the U3 region 

20 promoter of the Moloney murine sarcoma virus; Granzyme A promoter; regulatory sequences of 
the metallothioein gene; CD34 promoter; CDS promoter; thymidine kinase (TK) promoters; B19 
parvovirus promoters; and rous sarcoma virus (RSV) promoter. Additionally promoter elements 
from yeast and other fungi may be used, such as Gal 4 promoter and the alcohol dehydrogenase 
(ADH) promoter. These promoters are available commercially from various sources such as 

25 Stratagene (La JoUa, CA). It is to be understood that the scope of the present invention is not to 
be limited to a specific promoter. 

The vector may further comprise a polyadenylation signal that is positioned 3' of the 
carboxy-terminal amino acid. Vectors containing both a promoter and a cloning site into which 
a polynucleotide can be operably linked are well known in the art. Such vectors are capable of 

30 transcribing RNA in vitro or in vivo^ and are commercially available. Specific non-limiting 
examples include pSG, pSV2CAT, and pXtl from Stratagene (La Jolla, CA) and pMSG, pSVL, 
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pBPV and pSVK3 from Pharamacia. Other exemplary vectors include the pCMV mammalian 
expression vectors, such as pCMV6b and pCMV6c (Chiron Corporation, CA), pSFFV-Neo, and 
pBluescript-SK+. In order to optimize expression and/or in vitro transcription, it may be 
necessary to remove, add or alter 5'and/or3' untranslated portions of polynucleotides to 
5 eliminate potentially extra inappropriate altemative translation initiation codons or other 
sequences that may interfere with or reduce expression, either at the level of transcription or 
translation. Alternatively consensus ribosome binding sites can be inserted immediately '5' of 
the start codon to enhance expression. 

Particularly preferred vectors are retroviral vectors and reference is made to Coffin et al., 

10 "Retroviruses", (1997) Chapter 9 pp; 437-473 Cold Springs Harbor Laboratory Press. Retroviral 
vectors useful in the invention are produced recombinantly by procedures already taught in the 
art. W094/29438, W097/21824 and W097/21825 describe the construction of retroviral 
packaging plasmids and packing cell lines. Common retroviral vectors are those derived from 
murine, avian or primate retroviruses. The most common retroviral vectors are those based on 

15 the Moloney murine leukemia virus (MoMLV) and mouse stem cell vims (MSCV). Vectors 
derived firom MoMLV include, LMily, LINGFER, MINGFR, MND and MINT (Bender, et al, 
J. Virol. 61:1639 -1649 (1987); Miller, et al., Biotechniques 7: 98-990 (1989); Robbins, et al., J. 
Virol. 71 :9466-9474 (1997) and U. S. Pat. No. 5,707,865). Vectors derived from MSCV include 
MSCV-MiLy (Agarwal, et al., J. of Virology 72:3720). Further non-limiting examples of 

20 vectors include those based on Gibbon ape leukemia virus (GALV), Moloney murine sacroma 
virus (MoMSV), myeloproliferative sarcoma virus (MPSV), murine embryonic stem cell virus 
(MESV), spleen focus forming virus (SFFV) and the lentiviruses, such as Human 
immunodeficiency virus (HIV-l and HIV-2). New vector systems are continually being 
developed to take advantage of particular properties of parent retroviruses such as host range, 

25 usage of altemative cell surface receptors and the like (See C. Baum et al.. Chapter 4 in Gene 
Therapy of Cancer Cells eds., Lattime and Gerson (1998)). The present invention is not Hmited 
to particular retroviral vectors, but may include any retroviral vector. Particularly preferred 
vectors include DNA from a murine virus corresponding to two long terminal repeats, and a 
packaging signal. In one embodiment the vector is a MoMLV or MSCV derived vector. In 

30 another preferred embodiment the vector is MND. 
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In producing retroviral vector constructs, the viral gag, pol and env sequence will 
generally be removed from the virus, creating room for insertion of foreign DNA sequences. 
Genes encoded by foreign DNA are usually expressed under the control a strong viral promoter 
in the long terminal repeat (LTR). While a LTR promoter is preferred, as mentioned above, 
5 numerous promoters are known. 

Such a construct can be packaged into viral particles efficiently if the gag, pol and env 
functions are provided in trans by a packaging cell line. Therefore when the vector construct is 
introduced into the packaging cell, the gag-pol and env proteins produced by the cell, assemble 
with the vector RNA to produce infectious virions that are secreted into the culture medium. The 

10 virus thus produced can infect and integrate into the DNA of the target cell, but does not 
produce infectious viral particles since it is lacking essential packaging sequences. Most of the 
packaging cell hues currently in use have been transfected with separate plasmids, each 
containing one of the necessary coding sequences, so that multiple recombination events are 
necessary before a replication competent virus can be produced. Altematively the packaging cell 

15 line harbors a provirus. (The DNA form of the reverse-transcribed RNA once its integrates into 
the genomic DNA of the infected cell). The provirus has been crippled so that although it may 
produce all the proteins required to assemble infectious viruses, its own RNA can not be 
packaged into virus. RNA produced from the recombinant virus is packaged instead. Therefore, 
the virus stock released from the packaging cells contains only recombinant virus. Non-limiting 

20 examples of retroviral packaging lines include PA12, PA317, PE501, PG13, TCRIP, RD114, 
GP7C-tTA-G10, ProPak-A (PPA-6), and PT67. Reference is made to Miller et al., Mol. Cell 
BioL 6:2895 (1986); Miller, et al., Biotechniques 7:980 (1989); Danos, et al., Proc. Natl. Acad. 
Sci. USA 85:6460 (1988); Pear, et al., Proc. Natl. Acad. Sci. USA 90:8392 (1993); Rigg, et al., 
Virology 218: (1996); and Finer et al.. Blood 83:43 (1994). Retroviral vector DNA can be 

25 introduced into packaging cells either by stable or transient transfection to produce vector 
particles. 

Additionally preferred vectors include adenoviral vectors (See Frey et al. Blood 
91:2781 (1998) and WO95/27071) and adeno-associated viral vectors (AAV) (See Chatterjee et 
al., Current Topics in Microbiol, and Immunol. 218:61 (1996). Reference is also made to 
30 Shenk, Chapter 6, 161- 78, Breakefield et al., Chapter 8 201-235; Kroner-Lux et al, Chapter 9, 
235-256 in Stem Cell Biology and Gene Therapy, eds. Quesenberry et al., John Wiley & Sons, 
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1998 and U.S. Pat. Nos. 5,693,531 and 5,691,176. The use of adenovirus derived vectors may be 
advantageous under certain situations because they are capable of infecting non-dividing cells, 
and unlike retroviral DNA, the adenoviral DNA is not integrated into the genome of the target 
cell. Further the capacity to carry foreign DNA is much larger in adenoviral vectors than 

5 retroviral vectors. The adeno-associated viral vectors are another useful delivery system. The 
DNA of these viruses may be integrated into non-dividing cells, and a number of 
polynucleotides have been successfully introduced into different cell types using adeno- 
associated viral vectors. The vectors are capable of transducing several cell types including 
hematopoietic cells and epithelial cells. 

10 Vectors may also include hybrid vectors of AAV and adenoviruses as described in 

W096/13598 and W099/47691 (The Trustees of the University of Pennsylvania), W098/21345 
(General Hospital), US5965441 (General Hospital), or WO99/58700 (Ariad Gne Therap.), the 
teaching of which being incorporated into the present invention in their entirety. 

In one embodiment, the construct or vector will include not only a nucleic acid sequence 

15 encoding a mutated PTKR or a mutated EGFR as a selectable marker but also a second nucleic 
acid sequence encoding a heterologous gene to be transferred into a target cell. In a preferred 
embodiment the nucleic acid molecules are DNA. The term "heterologous" as it relates to 
nucleic acid sequences such as gene sequences denotes sequences that are not normally 
associated with a particular cell or vector and which are suitably inserted into a construct or 

20 vector under control of a promoter to permit expression in the target cell to be genetically 
modified. The heterologous gene may be located 5' or 3' to the selectable marker of the 
invention. 

Non-limiting preferred vector constructs comprise the general structure as outlined 
below 5' to 3': 
25 (a) LTR-X-I-M-LTR; 

(b) LTR-M-LTR; 

(c) LTR-M-(I)-X-LTR; 

(d) LTR-X-pM-LTR; and 

(e) LTR-X-I-M-SAR-LTR 
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wherein LTR is a long terminal repeat, X is a heterologous gene for a desired protein, M 
is a selectable marker of the invention, I is an internal ribosomal entry site, SAR is a scaffold 
attachment region and p is a second promoter, preferably a CMV or PGK promoter. 

A gene or coding sequence or a sequence that encodes a particular protein is a nucleic 
5 acid molecule that is transcribed and translated into a polypeptide in vitro or in vivo when placed 
under the control of appropriate regulatory sequences. The heterologous gene may be any gene 
for which expression is desired. Examples of heterologous genes include but are not limited to a 
therapeutic protein, a structural gene, a ribozyme, or an antisense sequence. The heterologous 
gene may be the entire protein or only the functionally active fragment thereof The protein may 
10 include for example one that regulates cell differentiation or a therapeutic gene capable of 
compensating for a deficiency in a patient that arises from a defective endogenous gene. 
Additionally a therapeutic protein or gene may be one that antagonizes production or function of 
an infectious agent, antagonizes pathological processes, improves a host's genetic makeup, or 
facilitates engrafltment. 

15 Specific examples of a therapeutic gene or gene sequences are ones effective in the 

treatment of adenosine deaminase deficiency (ADA); sickle cell anemia; recombinase 
deficiency; recombinase regulatory gene deficiency; HIV, such as an antisense or trans- 
dominant REV gene; or a gene carrying a herpes simplex virus thymidine kinase (HSV-tk)). 
The heterologous gene may encode new antigens or drug resistant genes or may encode a toxin 

20 or an apoptosis inducer effective to specifically kill cancerous cells, or a specific suicide gene to 
hematopoietic cells may be included. Additionally the heterologous gene may be a therapeutic 
gene that is non-human such as a yeast gene (Seo et al., Proc. Natl. Acad. Sci. 95:9167 (1998)). 

The vector or construct may also comprise a second heterologous gene in addition to the 
first heterologous gene encoding a protein of interest. More than one gene may be necessary for 

25 the treatment of a particular disease. Altematively more than one gene can be delivered using 
several compatible vectors. Depending on the genetic defect, the therapeutic gene can include 
regulatory and untranslated sequences. For human patients the therapeutic gene will generally 
be of human origin although genes of closely related species that exhibit high homology and 
biologically identical or equivalent function in hxmians may be used if the gene does not 

30 produce an adverse immune reaction in the recipient. 
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Nucleotide sequences of the heterologous gene encoding a protein of interest will 
generally be known in the art or can be obtained from various sequence databases such as 
GenBank. One skilled in the art will readily recognize that any structural gene can be excised as 
a compatible restriction fragment and placed in a vector in such a manner as to allow proper 

5 expression of the structural gene in target cells. 

The target cells of the invention are mammahan cells and these include but are not 
hmited to humans, mice, monkeys, chimpanzees, farm animals; such as cattle, sheep, pigs, 
goats, and horses, sport animals, pets; such as dogs and cats, and other laboratory rodents and 
animals; such as mice, rats, guinea pigs and the like. Preferably the target cells are human cells. 

10 Preferred human cells include liver, hematopoietic, smooth muscle, neural, endothelial vascular 
cells, tumor cells and epithelial cells. Hematopoietic cells are particularly preferred, and these 
cells encompass hematopoietic stem cells, erythrocytes, neutrophils, monocytes, platelets, mast 
cells, eosinophils and basophils, B and T lymphocytes, and NK cells as well as the respective 
lineage progenitor cells. Hematopoietic stem cells and T-cells are especially preferred. 

15 Hematopoietic stem cells (HSC) are defined as a population of hematopoietic cells containing 
long term multilineage repopulating potential. T-cells are defined as a type of lymphocyte and 
are thought to develop from hematopoietic stem cells. There are many types of T-cells including 
cytotoxic T-cells, helper T-cells, inducer T-cells and supressor T cells. 

Methods of obtaining target cells, particularly hematopoietic cells are well known in the 

20 art and not repeated herein. Non-limiting sources of hematopoietic cells, including 
hematopoietic stem cells, are bone marrow, embryonic yolk sac, fetal hver tissue, adult spleen, 
and blood such as adult peripheral blood and umbiUcal cord blood. (To et al., Blood 89:2233 
(1997)). Bone marrow cells may be obtained from ilium, stemum, tibiae, femora, spine and 
other bone cavities. 

25 The manner in which target cells may be separated from other cells is not critical to this 

invention. Various procedures may be employed and include physical separation, magnetic 
separation using antibody-coated magnetic beads, affinity chromatography, and cytotoxic agents 
joined to a monoclonal antibody or used in conjunction with a monoclonal antibody. Also 
included is the use of fluorescence activated cell sorters (FACS) wherein the cells can be 

30 separated on the basis of the level of staining of the particular antigens. These techniques are 
well known to those skilled in the art and are described in various references including U.S. 
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Patent Nos. 5,061,620; 5,409,8213; 5,677,136; and 5,750,397; and Yau et al, Exp. HematoL 
18:219-222(1990). 

The order of cell separation is not critical to the invention, and specific cell types may be 
separated either prior to genetic modification with the mutated PTKR or after genetic 

5 modification. Preferably cells are initially separated by a coarse separation followed by using 
positive and/or negative selection. In humans the surface antigen expression profile of an 
enriched hematopoietic stem cell population may be identified by CD34^Thy-l"^Lin". Other 
nonlimiting enriched phenotypes may include: CD2", CD3", CD4", CD8", CDIO', CD14", CD15", 
CD19", CD20", CD33", CD34", CD38^°^ CD45, CD59''^', CD7r, CDW109^, glycophorin , 

10 AC133"^, HLA-DR"^^', and ElVT . Lin" refers to a cell population selected on the basis of lack of 
expression of at least one lineage specific marker, such as, CD2, CD3, CD 14, CD 15 and CD56. 
The combination of expression markers used to isolate and define an enriched HSC population 
may vary depending on various factors and may vary as other express markers become 
available. 

15 Murine HSCs with similar properties to the human CD34"^Thy-l'^Lin" may be identified 

by kit^Thy-l.l^^Lin"^°Sca-l^ (KTLS). Other phenotypes are well known. When CD34 
expression is combined with selection for Thy-1, a composition comprising approximately 
fewer than 5% lineage committed cells can be isolated (U.S. Patent No. 5,061,620). 

It has been shown that CD3 is expressed on most T cells, and that these cells can express 

20 the cell surface antigens CD2, CD4, and CD8 antigens. Also CD45 is a useful T cell marker. 
The most well known T cell marker is the T cell antigen receptor (TCR). There are presently 
two defined types of TCRs; a, p- TCR and y, 6- TCR. B cells may be selected, for example, by 
expression of CD 19 and CD20. Myeloid cells may be selected for example, by expression of 
CD14, CD15 and CD16. NK cells may be selected based on expression of CD56 and CD16. 

25 Erythrocytes may be identified by expression of glycophorin A. Neuronal cells may be 
identified by NCAM and LNGFR (Baldwin et al., J. Cell Biochem. 15:502 (1996)). Vascular 
endothehal cells may be identified by VEGFR2, CD34, P-Selectin, VCAM-1, ELAM-1, and 
ICAM-l.(Horvathova et al., Biol. Trace Elem. Res., 69: 15-26 (1999). One skilled in the art is 
aware of other useful markers for the identification of other target cells. 

30 Once a population containing the target cells are harvested and target cells, particularly 

hematopoietic cells, are separated, the cells are cultured in a suitable medium comprising a 
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combination of growth factors that are sufficient to maintain growth. Methods for culturing 
target cells are well known to those skilled in the art, and these methods are only briefly 
mentioned herein. Any suitable culture container may be used^ and these are readily available 
from commercial vendors. The seeding level is not critical and will depend on the type of cells 

5 used, but in general the seeding level for hematopoietic cells will be at least 10 cells per ml, 
more usually at least about 100 cells per ml and generally not more than 10^ cells per ml when 
the cells express CD34. 

Various culture media, solid or liquid, can be used and non-limiting examples include 
DMEM, IMDM, X-vivo 15 and RPMI-1640. These are commercially available from various 

10 vendors. The formulations may be supplemented with a variety of different nutrients, growth 
factors, such as cytokines and the like. The medium can be serum free or supplemented with 
suitable amounts of serum such as fetal calf serum, autologous serum or plasma. If cells or 
cellular products are to be used in humans, the medium will preferably be serum free or 
supplemented with autologous serum or plasma. (Lansdorp et al., J. Exp, Med. 175:1501 (1992) 

15 and Petzer, et al. PNAS 93:1470 (1996). Also reference is made to Freshney, R. L, "Culture of 
Animal Cells, A Manual of Basic Techniques", Wiley-Liss, Inc. (1994)). 

Non-limiting examples of compounds which may be used to supplement the culture 
medium are TPO, FL, KL, IL-1, IL-2, IL-3, IL-6, IL-12, IL-11, stem cell factor, G-CSF, GM- 
CSF, Stl factor, MCGF, LIF MlP-la and EPO. These compounds may be used alone or in any 

20 combination, and preferred concentration ranges may be readily determined from the published 
art. When murine stem cells are cultured, a preferred non-limiting medium includes mIL-3, 
mIL-6 and mSCF. Other molecules can be added to the culture media, for instance, adhesion 
molecules, such as fibronection or RetroNectin™ (commercially produced by Takara Shuzo Co., 
Otsu Shigi, Japan). 

25 In vitro systems for measurement of mammahan stem cell activity include the long-term 

culture initiating cell assay (LTCIC) and the cobblestone-area-forming cell (CAFC) assay. 
(Pettengell, et al., Blood 84:3653 (1994); Breems et al, Leukemia 8:1095 (1994); Reading, et 
al, Exp. Hem. 22:786 (Abst # 406) (1994); and Ploemacher, et al., Blood 74:2755 (1989)). In 
the CAFC assay a sparsely plated cell population is simply tested for its ability to form distinct 

30 clonal outgrowths (or cobblestone areas) on a stromal cell monolayer over a period of time. This 
assay gives frequency readouts that correlate with LTCIC and are predictive of engraftment in in 



22 



vivo assays and patients. A particularly preferred CAFC assay is described in Young, et al., 
Blood 88:1619 (1996). Vlow cytometry can be used to subset hematopoietic cells from various 
tissue sources by the surface antigens they express. A combination of these assays may be used 
to test for hematopoietic cells or stem cells. 

5 In one preferred embodiment the invention concems a method of identifying genetically 

modified mammalian cells comprising introducing a nucleic acid sequence encoding a mutated 
protein-tyrosine kinase receptor (PTKR) as a selective marker and operatively linked to an 
expression control sequence into a target cell to form a genetically modified cell; allowing 
expression of the mutated PTKR in the genetically modified cell; and identifying said 

10 genetically modified cell expressing the mutated PTKR. hi a preferred embodiment the selective 
marker is a mutated epidermal growth factor receptor (EGFR) family member, particularly 
preferred is a mutated EGFRl and more specifically the sequences designated EGFRl-I and 
EGFRl-IL A polynucleotide or nucleic acid sequence is said to "encode" a polypeptide if, in its 
native state or when manipulated by methods well known to those of skill in the art it can be 

15 transcribed and/or translated to reproduce a polypeptide or fragment thereof A construct or 
vector including a mutated PTKR of the invention may be introduced into the target population 
by any means of genetic transfer or modification known in the art. 

The term "genetic modification" refers to any addition, deletion or disruption to a cells 
normal nucleotides and the methods of genetic modification are intended to encompass any 

20 genetic modification method of introducing nucleic acid sequences encoding the selective 
markers according to the invention and including but not limited to heterologous or foreign 
genes into mammahan target cells. These techniques are generally known. The term introducing 
is broadly used herein, and includes for example inserting. The term genetic modification 
includes but is not limited to transduction (viral mediated transfer of host DNA from a host or 

25 donor to a recipient, either in vivo or ex vivo), transfection (transformation of cells with isolated 
viral DNA genomes), liposome mediated transfer, electroporation, calcium phosphate 
transfection and others. Methods of transduction include direct co-culture of cells with producer 
cells (Bregni, et aL, Blood 80:1418-1422 (1992)); culturing with viral supernatant alone, with or 
without appropriate growth factors and polycations (Xu, et al., Exp. Hemat. 22:223-230 (1994), 

30 and spinoculation. 
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In a preferred embodiment the mutated PTKRs of the invention and particularly the 
mutated EGFR markers are introduced into target cells by transduction with a retroviral vector 
as previously described. The host cell range that may be infected is determined by the viral 
envelope protein. The recombinant virus can be used to infect virtually any other cell type 

5 recognized by the env protein provided by the packaging cell, resulting in the integration of the 
viral genome in the transduced cell and the stable incorporation of the foreign gene product. In 
general, murine ecotropic env of MoMLV allows infection of rodents cells, whereas 
amphotropic env allows infection of rodent, avian and primate cells including human cells. 
Amphotropic packaging of cell lines for use with retroviral systems are known in the art and are 

10 commercially available. These include but are not limited to, PA12, PAS 17, v|/CRIP, and 
FLYA13. (See Miller, et al., Mol. Cell Biol. 5:431-437 (1985); Mill, et al., Mol. Cell Biol. 
6:2895 -2902 (1986); and Danos, et al. (1988) Proc. Natl. Acad. Sci. USA 85:6460-6464. 
Recently, the G-glycoprotein from vesicular stomatitis virus (VSV-G) has been substituted for 
the MoMLV env protein. (See Bums, et al, Proc.Natl. Acad. Sci. USA 90:8033-8037 (1993); 

15 and W092/14829). Xenotropic vector systems also exist which allow infection of human cells. 

Once the target cells are genetically transformed with a nucleotide sequence including 
the mutated PTKJR as a marker, and optionally a heterologous gene, the modified cells 
expressing the mutated PTKR selectable marker may be identified by numerous techniques. The 
term "identify or "identification" used herein in reference to genetically modified cells, unless 

20 otherwise indicated means to mark, to purify, to enrich, to select, to isolate, or to separate. 
Identification can be by a single or multiple steps. 

In a preferred embodiment, the genetically modified cells expressing the selective 
marker according to the invention are identified by an antibody that specifically recognizes and 
binds to the selectable marker, and particularly an antibody that recognizes a mutated EGFR. 

25 This kind of antibody has been described by O'Rourke, et al, Oncogene, 16:1197-1207 (1998). 
A secondary antibody may then be used to fiuther identify or select the antibody coated cells, if 
the secondary antibody is coupled to either a fluorophore or immuno-magnetic beads. The 
marker gene expressing cells may then be selected by flow cytometry or a by using a magnet to 
select bead-coated cells. 

30 The techniques used for the identification of genetically modified cells expressing a 

selective marker of the invention include those described above and other well known 
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techniques including but are not limited to immunoselection; nucleotide detection by northern 
blots wherein RNA bound to a solid support is analyzed for binding to a liquid phase DNA; 
nucleotide detection by southern blots, wherein genomic DNA bound to a solid support is 
analyzed for binding to a liquid phase DNA; PGR amplification of genomic DNA; protein 

5 detection by western blots whereby protein bound to a solid support is analyzed for binding to a 
liquid phase antibody; reverse transcription of mRNA and amplification with PGR; and FISH 
wherein chromosomes are analyzed by Fluorescence in situ hybridization with a hquid phase 
DNA (Lawrence, et al, Science, 249(4971):928 -932 (1990)). 

The above listed methods are not described in detail herein; they are well known by 

10 those of ordinary skill in the art. In brief, antibodies may be obtained by methods well known 
and reference is made to Harlow, et al., "Antibodies: A Laboratory Manual: (1988), 
Biosupplynet Source Book, (1999) Gold Springs Harbor Laboratory. Either polyclonal 
antibodies that are reactive to the antigen of interest may be used or monoclonal antibody 
producing cell clones may be generated. According to the invention the antibody must recognize 

15 the extracellular domain of the mutated PTKR selective marker. More particularly if parts of the 
extracellular domain are modified, by for example deletion, the antibody should recognized an 
epitope of the remaining amino acid sequence of the mutated PTKR. 

Monoclonal antibodies to EGFR are available commercially. Some sources include 
Galbiochem (GA), Pharmingen (GA), Becton Dickinson (GA), and the American Type Gulture 

20 GoUection (ATGG) (Virginia). The antibody may be identified and assayed in vitro by a range 
of methods including gel diffusion, immunoassay, immunoelectrophoresis and 
immunofluorescence. Once genetically modified cells are labeled they can be incubated with an 
antibody against the mutated receptor. 

The genetically modified cells may be physically separated by the use of antibodies, 

25 flow cytometry including fluorescence activated cell sorting (FAGS), and bead selection. (U.S. 
Pat. No. 5,011,912). 

In FAGS an antibody that recognizes the mutated PTKR may be used to identify the 
cells that express the PTKR. This primary antibody can be conjugated to a fluorophore, such as 
fluorescein isothiocyanate (FITG), phycoerythrin (PE), cy-chrome (GyG), allophycocyanine 
30 (APG), tricolor (TG) or Texas Red (TX). If the primary antibody is not conjugated to a 
fluorophore a secondary antibody that is conjugated to a fluorophore may be introduced into the 
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cell sample containing cells which have expressed the mutated PTKR and which are recognized 
by the primary antibody. The primary antibody is attached to the mutant PTKR. The secondary 
antibody binds to the primary antibody and cells having the secondary antibody can be induced 
to fluoresce. Separation may be achieved by the fluorescence activated cell sorter. 

5 The genetically modified target cells expressing a mutated PTKR, and optionally a 

second nucleic acid sequence encoding a protein of interest may be expanded, either prior to or 
after identification and selection, by culturing the cells for days or weeks in appropriate culture 
media, with or without supplements by means well known in the art. (Freshney supra, Celis 
supra, and Coligan et al, supra), 

10 The genetically modified marked cells obtained according to the methods of the 

invention may further be used in an autologous or allogeneic setting wherein the genetically 
modified target cells, preferably hematopoietic cells are expanded and then used in gene therapy 
for example in bone marrow transplantation, graft faciUtation, or immune reconstitution. The 
expanded cells expressing the mutated cell surface marker may be infused into a subject. 

15 Samples may be taken and then tested for the selectable marker by FACS analysis, PGR, or 
FISH, as referenced above, to determine the persistence of the marked cells and further to assess 
efficiency of transduction. 

The invention generally described above will be more readily understood by reference to 
the following examples, which are hereby included merely for the purpose of illustration of 

20 certain embodiments of the present invention and are not intended to limit the invention in any 
way. 

Example 1: 

25 A. Isolation of human EGFR cDNAs: 

EGFR cDNA is isolated by PGR from cDNA that has been generated from colorectal 
adenocarcinoma: SW480 cell line (Marathon cDNA, Glontech (GA); Leibowitz, et al, Gancer 
Research 36:4562-4569 (1976). The following primers are used in the described methods and 
were obtained from Life Technologies (MD): 

30 

5' Primers: 
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EGFRl CTA GGC TAG CAT GCG ACC CTC CGG GAC GGC C SEQ ID NO. 3 
EGFR2 CTC TGC CCG GCG AGT CGG GCT GAC AGC TAT GAG ATG GAG GAA SEQ ID 
NO. 4 



5 3' Primers: 

EGFR3 TTC CTC CAT CTC ATA GCT GTC AGC CCG ACT CGC CGG GCA GAG SEQ ID 
NO. 5 

EGFR2220RGGA TAT CCT ACG TGC GCT TCC GAA CGA TGT G SEQ ID NO. 6 



10 B. Generation ofEGFRl-I: 

The primers EGFRl and EGFR2220R are used to generate an intracellular deletion 
mutant of EGFR (Figures 2 and 3) from the SW480 adenocarcinoma cell line cDNA. Amino 
acid residues 679 - 1210 of the sequence in Figure 2 (SEQ ID NO. 2) are deleted, and this 
mutant EGFR is designated EGFRl -L 

15 The 5' primer EGFRl covers the start codon of the EGFR. The 3'-primer EGFR2220R 

contains a stop codon in place of amino acid 679 of the SEQ ID NO. 2 (Figure 2). Using 
primers EGFRl and EGFR2220R results in the amplification of the EGFR sequence that has a 
deletion of amino acids 679 to 1210. 

The following PGR reaction is performed: Marathon cDNA (- 0.5 ng) is mixed with 

20 Advantage cDNA buffer (10 mM Tris-HCl (pH=7.5 at 42^C), 50 mM KCl, 2.5 mM MgCb, 
0.001% Gelatin, 2.5 |Limol dATP, 2.5 |Limol dCTP, 2.5 i^mol dGTP, 2.5 |amol dTTP), 0,01 OD260 
primer EGFRl, 0.01 OD260 primer EGFR2220R, 1 \i\ Advantage cDNA polymerase (Life 
Technologies; MD), 5U Pfu Turbo polymerase (fi*om Pyrococcus furiosus) (Promega; WI) and 
water in a final volume of 50 |iiL PGR is performed as follows: Cycle 1: 95^C for 5 min, Cycle 

25 2-15: 95°C for 1 min, 60^C for 1 min, 68^C for 4 min, and Cycle 16: eS^'C for 10 min. 

The reaction is then cooled to 4°C in the PGR machine and subsequently the amplified 
cDNA is ethanol precipitated with 0.3 M sodium acetate. The pellet is washed once with 70% 
ethanol, dried and resuspended in 50 lal H2O. 

1 \A of the above PGR reaction is then reamplifiedj and the reaction mix contains for the 

30 second round of amplification in addition to 1|li1 of the above PGR reaction: Pfij buffer (20mM 
Tris-HCl (pH8.8), 2 mM MgS04, 10 mM KCl, 10 mM (NH4)2S04, O.l % Triton X-100, 0.1 
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mg/ml BSA), 2.5 |Limol of each dNTP (dATP, dTTP, dCTP, dGTP), 0.01 OD260 primer EGFRl, 
0.01 OD260 primer EGFR2220R, 5 U Pfu Turbo Polymerase and water in a final volume of 50 
^il. PGR is performed as follows: Cycle 17: 95^C for 5 min, Cycle 18^8: 95°C for 1 min, 60^C 
for 1 min, 72°C for 4 min, and Cycle 49: 72^ for 10 min. 

5 The reaction is cooled to 4°C in the PGR machine and the amphfied cDNA is ethanol 

precipitated with 0.3 M sodium acetate. The pellet is washed once with 70% ethanol, dried and 
resuspended in 20 \xl H2O. The PGR reaction is loaded on a IxTAE gel. A band with the size 
of approximately 2200 bp is isolated fi-om the gel and cloned into the Srfl restriction site of 
pPGR-Script Amp vector (Stratagene; GA) according to the manufacturer's protocol. The 

10 resulting vector is called pPCR-Script EGFRl-L The correctness of the subcloned PGR product 
is confirmed by restriction analysis and sequencing according to well known methods. 



15 G. Generation of EGFRl-II: 

EGFRl-II contains the same deletion in the intracellular domain as EGFRl-I (a stop 
codon in position 679). In addition, amino acids 25 to 312 are deleted in the extracellular 
domain of EGFR depicted in Figure 2 (SEQ JD NO. 2). The sequence of the signal peptide, 
amino acid 1 to 24, is fused to amino acid 313 of the EGFR extracellular domain, hi order to 

20 delete the EGFR extracellular domain, a protocol is used that is described in fiirther detail 
below. (Also see White, ed. Methods in Molecular Biology, Vol 15. PGR Protocols, Chapter 25 
(1993)). This method is known in the art as "gene splicing by overlap extension (gene 
SOEing)". (Figure 4) 

Primer pairs EGFR1/EGFR3 and EGFR2/EGFR2220R are used to amphfy 2 PGR 
25 products (a) and (b). Primer EGFRl encodes amino acid 1 to 7, and Primer EGFR3 encodes 
amino acid 18 to 24 fixsed to the nucleotide sequence of amino acid 313 to 319. Using the 
EGFRl and EGFR3 primers a PGR product is generated that encodes amino acid 1 to 24 which 
is fixsed to amino acid 313 to 319. 

Primer EGFR2 encodes the same amino acid acids as Primer EGFR3 only in the reverse 
30 orientation. Primer EGFR2220R encodes a stop codon in place of amino acid 679. As 
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described in section B above, this results in the deletion of the intracellular domain after amino 
acid 678. 

Using these two primer pairs, two PGR products (a) and (b), are obtained each having a 
42 nucleotide of overlapping sequence encoding amino acid 18 to 24 and amino acids 313 to 
5 319 (Figure 4 - PGR reaction 1). 

In the second PGR reaction, primers EGFRl and EGFR2220R are used with the 
intermediate PGR products (a) and (b). When these intermediate PGR products are mixed, 
denatured and reannealed, one of the strands of the two PGR products can overlap at their 3' 
ends, and act as primers on one another to make the mutant product. Subsequently, the mutant 
10 PGR product (c) is amplified with the primers EGFRl and EGFR2220R. The resulting PGR 
product encodes EGFRl -II amino acid 1 to 24 of the wt EGFR fused to amino acids 313 to 678 
of the wt EGFR with a stop codon in position 679 (Figure 4). 

The PGR reactions are as follows: The first PGR reaction in: a.) 0.01 OD260 of EGFRl 
and 0.01 OD260 of EGFR3 and in b.) 0.01 OD260 EGFR2 and 0.01 OD260 EGFR2220R are 
15 mixed with -0.4 ng Marathon cDNA, Ix Advantage cDNA Polymerase buffer (as described in 
B), 1 iLil Advantage cDNA polymerase, 2.5 |imol of each dNTP, 5U Pfu Polymerase (Promega) 
and H2O in a final volume of 50 ^il. The PGR is performed as follows: Gycle 1 : 94^G for 5 min, 
Gycle 2-16: 94T for 0.5 min, 60^G for 1 min, 68^G for 7 min, and Gycle 17: 68''G for 10 min. 
The reactions are cooled to 4°G in the PGR machine and the amplified PGR products are 
20 ethanol precipitated with 0.3 M sodium acetate. The pellets are washed once with 70% ethanol, 
dried and resuspended in 50 |li1 H2O. 

5 |Lil of each PGR reaction (a) and (b) are then further amplified. The reaction mix 
contains for the second round of amplification, in addition to each PGR reaction (a) O.Ol OD260 
of primers EGFRl and EGFR3 or (b) 0.01 OD260 of primers EGFR2 and EGFR2220R: 1 x Pfii 
25 buffer, 2.5 jumol of each dNTP, and 5 U Pfu Polymerase (Promega; WI) and H2O in a final 
volume of 50 |li1. The PGR is performed as follows: Gycle 18: 94''G for 5 min, Gycle 19-49: 
94^G for 0.5 min, 60''G for 1 min, 72''G for 6 min and Gycle 50: 72^G for 10 min. 

The reactions are then cooled to 4^G in the PGR machine and the amplified PGR 
products are phenol/chloroform extracted and ethanol precipitated with 0.3 M sodium acetate. 
30 The pellet is washed once with 70% ethanol, dried and resuspended in 20 |il H2O. 
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In a second PGR reaction, PGR product (a) and (b) are mixed in equimolar amounts and 
reamplified with Pfu Turbo Polymerase (Stratagene) using the primer pair EGFRl and 
EGFR2220R. The PGR mix is the same as described for cycles 18-50. The PGR is performed 
as follows: Gycle 1: 95X for 5 min, Gycle 2 - 32: 95''C for 1 min, 60^G for 1 min, 72''G for 4 
5 min, and Gycle 33: 72°G for 10 min. The PGR is then loaded on a gel (as described above) and a 
band of --1200 bp is isolated. This PGR product is cloned into the Srfl site pPCR-Script 
(Stratagene; GA) according to the manufacturer's protocol The resulting vector is called pPGR- 
Script EGFRl -11. The correctness of the subcloned PGR product (c) is confirmed by restriction 
analysis and sequencing according to well known methods. 

10 

D. Generation of retroviral vectors containing mutated EGFRs and viral supernatants: 
EGFRl -I and EGFRl -II sequences are excised from pPGR-Script EGFR-I and pPGR-Script 
EGFR-II using the restriction sites NotI and Xhol and are cloned into the multiple cloning site 
of the Murine Leukemia Virus (MoMLV) based retroviral vector pGla (GTI, Maryland) - 

15 (pGla-mutated sequence - IRES- nerve growth factor receptor (NGFR)), which has been cut 
with NotI and Xhol. The retroviral vectors are designated pGlaEGFRl-I and pGlaEGFRl-H. 

The constructs are cotransfected into human embryonic kidney cells 293T (293T cells) 
(Gary Nolan, Stanford) with an envelope construct pGiGL that permits expression of the 
Vesicular Stomatitis Virus G-Protein (VSV-G envelope) under the control of the 

20 cytomegalovirus (GMV). Also cotransfected into 293T cells is the packaging construct pGiGP 
(encoding MoMLV gag-pol under the control of the GMV promoter ) using the GaGla technique 
(Glontech; GA). (WO 97/21825 and Rigg, et al., Virology 218:290-295 (1996)) 

Viral supernatants are collected 24, 48, and 72 hours after transfection, centrifuged at 
1200 rpm in a Beckman GS-6KR centrifuge to remove particulate material, and either used 

25 immediately to transduce cells or frozen in a dry ice/methanol bath. The viral supernatants are 
used to transduce the packaging cell line ProPak-A-6 (PPA-6) (Systemix, Inc.). The PPA-6 cell 
line is a derivative of 293T cells expressing the MLV amphotropic envelope and MLV gag/pol 
under the control of the GMV promoter (Rigg et al. supra). The PPA-6 cells are sorted by 
immuno-magnetic bead selection. Alternatively, if the cells only expressed low amounts of 

30 mutated EGFR, the cells are sorted by FAGSorting as described in detail below. Supematants 
from PPA-6 cells are collected on day 2, 3 and 4 after transduction and treated as described for 
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293T cells. The generated supematants of PPA-6 cells contain recombinant viral particles that 
have the amphotropic envelope and are used to transduce human primary cells and cell lines. 

E. Tissue culture and cell lines: 

5 The following cell lines and primary cells are used: (a) human T cell line, CEMSS 

(Frederico, et al, J.Biol. Regul. Homeost Agents, 7: 41-49 (1993)), (b) human embryonic 
kidney cells 293T (293T) (Pear, et al, Proc. Natl. Acad. Sci. USA 90:8392 - 8396 (1993)), and 
(c) PPA-6 (Rigg, et al., supra). Human primary T cells are obtained by isolating the 
mononuclear cell fraction from human blood using FicoU-gradient centrifugation (Noble and 

10 Cutts, Can. Vet. J. 8: 110 - 111 (1967) and Boyle and Chow, Transftision 9:151 - 155 (1969). 
Himian blood (buffycoats; Pietersz, et al, Vox Sang 49: 81 - 85 (1985) and Pietersz, et al., Blut. 
54:201 - 206 (1987) obtained from the Stanford Bloodbank are diluted 1:1 with phosphate 
buffered saline (PBS). In 50 ml tubes, 15 ml blood are overlaid onto 21 ml of Ficoll (Isoprep; 
Robbins Scientific, Cahfomia). The gradient is spun for 30 minutes at 1700 rpm in a Beckman 

15 GS-6KR at room temperature and stopped without using the break. Peripheral blood 
mononuclear cells (PBMCs) are collected from the interface of the Lymphoprep gradient. The 
resulting PBMCs are fiirther purified on a second Ficoll gradient to remove remaining blood 
cells. The PBMCs are incubated for 1 hour in a tissue culture flask in tissue culture media for 
PBMCs (see below) at 37^C in 5% CO2 allowing adherent cells such as macrophages to attach 

20 to the tissue culture flask. Non-adherent cells (T/B/NK cells) are used to purify CD4"^ T cells. 
Cells (2 X 10^) are incubated with 300 |j,1 anti-CD4 antibody for 1 hour at 4°C. The cells are 
washed 3 times with PBS arid incubated with 1x10^ anti-mouse IgG coupled immuno-magnetic 
beads (Dynal, Oslo) for 1 hour at 4°C. The CD4^ cells that bound the CD4-antibodies and the 
secondary antibody coupled magnetic beads are positively selected by incubating the cells for 

25 10 minutes on a Dynal magnet. After removing the unbound cells, the remaining cells are taken 
off the magnet and put into culture (see below). Usually the beads remain for up to 10 days on 
the cells. 

CD34'' cells are isolated from G-CSF mobilized peripheral blood (MPB) using Isolex 
300SA or 3001 (Baxter, IL) (Systemix, CA). The cells are approximately 80 - 90% pure. 
30 Cells are cultured in a Steri-Cult 200 incubator (Forma-Scientific) at 5% CO2. Media 

(DMEM, Iscove's medium RPMI), PBS, and sodium pyruvate are obtained from JRH 
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Biosciences (California), FBS from Hyclone (Utah), L-glutamine, Trypsin and MEM vitamins 
from Life Technologies (Maryland), ITS (insulin/transferrin/sodium selenite), PHA 
(phythemaglutinin), Interleukin-2 (11-2) from Sigma (Missouri). 

The cells are cultured in DMEM, 10% FBS, 1% sodium pyruvate, 1% L-glutamine (293 T, 
5 PPA-6); RPMI, 10% FBS, 1% sodium pyruvate, 1% L-glutamine (CEMSS), Iscove's medium 

10% FBS, 1% L-glutamine, 1% ITS (insulin/transferrin/sodium selenite, 0.5 mg/ml stock), 1% 

MEM vitamins (human PBMCs). CD4'^ T cells are stimulated every 10-12 days with 1-2 |ig/ml 

PHA, irradiated feeder cells and 40 U/ml 11-2. 

Irradiated feeder cells are generated as follows: PBMCs are isolated as described above 
10 and irradiated with 3500 rads. The cells are mixed at a 10:1 ratio with the B cell lymphoma JY 

(Barbosa, et aL, Proc. Natl. Acad. Sci. 81:7549-7553 (1984)) that had been irradiated with 6000 

rads. 

To obtain T-cells in a resting state, 4-5 days after stimulation T cells are transferred into 
fresh medium that contained 20 U/ml 11-2. CD34"^ cells are grown in X vivo 15, 50 ng/ml 
15 thrombopoietin) (TPO; R&D Systems, MN), 100 ng/ml flt-3 Ugand (FL), 100 ng/ml steel 
factor (SF). (SyStemix, Inc.) 

In order to passage adherent cells, the cells are washed once with PBS, then trypsinized 
for 5 min and subsequently split into new tissue culture flasks (VWR, New Jersey). 

20 F. Transduction of PPA-6. human T cell lines, primary T cells and CD34^ cells: 

\(f cells/ml from step (E) are transduced with 1-3 ml of viral supernatant, that has been 
either generated from 293 T cells or PPA-6 cells, by spinoculation with 8 |ag/ml protamine 
sulfate (Sigma, Missouri). Spinoculation is done at ZTC for 3 hrs at 3000 rpm (CD34"^ cells, 
PPA-6) or at 2750 rpm (human T cells, CEMSS). PPA-6 cells are transduced in 6 well plates, 

25 non adherent cells in 6 ml tubes (VWR). Hiunan T cells and CD34^ cells are activated for 2 
days prior to the transduction procedure with PHA/IL-2 or TPO/FL/SF, respectively. 

G. FACS analysis. FACSsortins: and immuno-magnetic bead selection of cells expressing 
EGFRU and EGFRl-II: 

30 The following antibodies and reagents are used for staining. CD4-FITC (Caltag; CA), 

anti-CD34-APC (Becton Dickinson; NJ), Thy-l-PE (Becton Dickinson), propidium iodide (PI) 
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(Sigma), unconjugated anti-EGFR antibody (GROl) (Calbiochem; CA), goat anti-mouse IgG- 
PE/FITC (Caltag), goat anti-mouse IgG2a-PE/FITC (Caltag), and anti-mouse IgG coupled 
magnetic beads (Dynal; Oslo). The anti-bodies and immuno-magnetic beads were used 
according to manufactiiter's protocol. 1x10^ cells are stained in 50|j,l of PBS/2% FBS for 20 to 

5 60 minutes at 4^C. When a secondary antibody is used the cells are washed once with 2 ml of 
PBS/2%FBS, then incubated again in 50 jil PBS/2%FBS and the secondary antibody is added. 
Before FACS analysis, the cells are again washed once with PBS/2%FBS, centrifuged and then 
resuspended in 500 |al PBS/2%FCS containing l|ig/ml PL FACS analysis is performed on a 
FACSscan (Becton-Dickinson Immunocytometry Group; CA) according to manufacturer's 

10 instructions. 

Cells that express the marker genes, EGFRl-I and EGFRl-H are sorted as follows: 2 x 
10^/ml PBS/2%FBS cells are stained with an anti-EGFR antibody (at least with l|il antibody/10^ 
cells; GROl). The cells are washed twice with PBS/2%FBS and subsequently incubated with a 
fluorophore (FITC, PE) conjugated rat anti-mouse IgG antibody (5 ^xl/10^ cells) that recognizes 
15 the anti-EGFR antibody. The cells are also stained with PI to distinguish between dead and live 
cells. Ix 10^ cells/ml are sorted for EGFR-positive and Pl-negative cells. This is done on a 
FACSstar Plus (Becton-Dickinson Immunocytometry Group) according to manufacturer's 
protocol. 

To isolate cells by bead selection, the cells are stained with an anti-EGFR antibody. The 
20 lO^cells/ml are incubated with lOjal/ml anti-EGFR antibody (GROl) in PBS/2%FBS for 1 hr on 
ice with occasional shaking. The cells are washed 3 times with PBS/2%FCS and then anti-IgG 
antibody coupled magnetic beads (Dynal; Oslo) that can recognize the anti-EGFR antibody are 
added (~ 5 beads per positive cell). Again the cells are incubated for 1 hr on ice. The cells that 
express EGFR are selected by positive selection with a Dynal magnet (Dynal; Oslo) for 10 min. 
25 The unbound cells are removed and the EGFR expressing cells are put into culture (see section 
E.). 

Figure 5 illustrates the expression of EGFRl-II CD34^ cells after transduction of these 
cells with supematants that are generated from PPA-6 cells. Primary human T cells are 
transduced with PPA-6 supematants and are selected by immuno-magnetic beadselection then 
30 sorted using a FACStar Plus (data not shown). The cells are enriched from 16 % to 92 %. 



33 



Example 2: 

A. Isolation of human MuSK-R cDNAs: 

MuSK-R is isolated by PGR from fetal skeletal muscle cDNA (Marathon cDNA, Invitrogen) 
5 using primers flanking the 5 ' and 3 ' of the MuSK-R cDNA. 

The following primers obtained from Operon Technologies, Inc. are used to amplify MuSK- 
RcDNA: 

MuSK21FN: CGT CCT GCG TGA GCC TGG ATT AAT C SEQ ID NO: 9 

MuSK34FN: GCC TGG ATT AAT CAT GAG AGA GCT C SEQ ID NO: 10 

10 MuSK2666RN: CGA GGC CTG TCT TCA ACC TTA GAC 

ACT CAC AGT TCC CTC TGC SEQ ID NO: 1 1 

The 5' primer MuSK21FN covers 25nucleotide (nt) before the start codon, the second 5' 
primer MuSK34FN covers the start codon (aa 1) of MuSK-R and surrounding sequence. The 
3 '-primer MuSK2666RN covers the stop codon of MuSK-R and surroxmding sequence. Using 
1 5 primers MuSK21FN, MuSK34FN and MuSK2666RN results in the ampUfication of a DNA 
fragment of - 2600 bp that encodes a MuSK-R wt protein. 

The following PCR reaction is performed: Marathon cDNA 2 ng) is mixed with 
Advantage cDNA buffer (10 mM Tris-HCl (pH-7.5 at 42^C), 50 mM KCl, 2.5 mM MgCl2, 
0.001% Gelatin), 2.5 |imol dATP, 2.5 |imol dCTP, 2.5 ^imol dGTP, 2.5 j^mol TTP), 1 |ig primer 
20 MuSK21FN, 1 |Lig primer MuSK2666RN, 1 |li1 Advantage cDNA polymerase, and water in a 
final volume of 50 |il. The PCR is performed as follows: Cycle 1 : 94''C for 5 min, Cycle 2-11: 
94^C for 0.5 min, 63^C for 1 min, 68^C for 6 min, and Cycle 12: 68T for 10 min. 

The reaction is cooled to 4^C in the PCR machine, and the amphfied cDNA is ethanol 
precipitated with 0.3 M sodium acetate. The pellet is washed once with 70% ethanol, dried and 
25 resuspended in 100 jil H2O. 

10 |al of the above PCR reaction is then reamplified. The reaction mix contains for the 
second round of ampUfication step in addition to 10 |li1 of the above PCR reaction: Pfti buffer 
(20mM Tris-HCl (pH8.8), 2 mM MgS04, 10 mM KCl, 10 mM (NH4)2S04, 0.1 % Triton X-100, 
0.1 mg/ml BSA), 2.5 |Limol of each dNTP (dATP, dCTP, dGTP, dTTP), 1 ^ig primer 
30 MuSK34FN, 1 |ig primer MuSK2666RN, 5 U Pfii Turbo Polymerase (fi:om Pyrococcus 
fliriosus) and water in a final volume of 50 |liL The PCR is performed as follows: Cycle 13: 
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94T for 5 min, Cycle 14-^3: 94T for 0.5 min, 62X for 1 min, 72^C 6 min, and Cycle 44: 12X 
for lOmin. 

The reaction is cooled to 4^C in the PCR machine and the ampHfied cDNA is ethanol 
precipitated with 0.3 M sodium acetate. The pellet is washed once with 70% ethanol, dried and 

5 resuspended in 20 |al H2O. The PCR reaction is loaded on a IxTAE gel. A band with the size 
of - 2600 bp is isolated from the gel and cloned into the Srfl restriction site of pPCR-Script 
Amp vector (Stratagene, CA) according to the manufacturer's protocol. The resulting vector is 
called pPCR-Script MuSK-R-wt. The correctness or the subcloned PCR product is confirmed 
by restriction analysis and sequencing by methods well known in the art. (The nucleotide 

1 0 sequence is illustrated in SEQ ID N0.7) 

B. Generation of mutations in the intracellular domain of MuSK-R by PCR: 

The primers MuSK1380F, MuSK1657R, and 1747R are used to generate intracellular 
deletion mutants of MuSK-R from the plasmid pPCRScriptMuSK-R. The primer sequences are 
1 5 as follows wherein p means phosphorylated: 

Primer 1380F: 5' pCG GCC TGT GCC AGA CTG CCA CAT CTA G (SEQ ID NO: 12); 
Primer 1657R: 5' pCG TCT AGG TGA GGG TTA CTG CTG 

CTG ATT CTC (SEQ ID NO: 1 3) 

20 Primer 1 747R: 5^ pGG TTA ACC CTA TTC AAT GTT ATT 

CCT TGA ATA CTC CAG (SEQ ID NO: 14). 

Using primer pair MuSK1380F and 1657R results in the deletion of amino acid residues 
538 - 879 of MuSK-R, using primer pair MuSK1380F and 1747R results in the deletion of 

25 amino acid residues 577 - 879. The two mutant forms of MuSK-R are designated MuSK- 
RA538-879 (MuSK-RI) and MuSK-RA577-879 (MuSK-RII). In both MuSK-RI and MuSK-RH 
most of the intracellular domain of MuSK-R as shown in Figure 6 is deleted. While not meant 
to limit the invention in any manner, it is believed that both truncations result in a deletion of the 
kinase domain and most of the substrate binding motifs of the wt MuSK-R illustrated in Figure 

30 6. 
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The 5' primer MuSK1380F covers the nucleotide sequence 1333-1410 of the MuSK-R. 
The 3'-primers MuSK1657R and 1747R contain stop codons in place of amino acid 538 and 
577 of MuSK-R. Using primer MuSK1380F with MuSK1657R or MuSK1747R results in the 
amplification of MuSK-R nucleotide sequence 1333 to 1614 that has a stop codon in the 

5 position of amino acid 538 or nucleotide sequence 1333-1728 that has a stop codon in the 
position of amino acid 577, respectively. The PGR reaction includes -10 ng hMuSK-R wt 
DNA, 1 X Pfli buffer, 1 [xg of primer MuSK1380F and either 1 ^ig primer MuSK1657R or 
MuSK1747R, 2.5 |amol of each dNTP, 5 U Pfu polymerase and H2O in a final volume of 50 \xl 
The PGR reaction is performed as follows: Gycle 1: 95°C for 5 min, Gycle 2-31: 95''C 0.5 min, 

10 eO'^C for 1 min, 72T for 4 min, Cycle 32: 72''C for 10 min. The PGR reaction is cooled to 4^G 
in the PGR machine and then loaded on a 1 x TAE gel. 

The two PGR products MuSK-RI (nt 1380-1614) and MuSK-RII (nt 1380 - 1728) are 
cloned into the Srfl site of pPGR-ScriptAmp (Stratagene) according to the manufacturer's 
protocol. As the 5' coding sequence from MuSK (nt 1 - 1379) is missing in these constructs, 

15 this sequence is excised from the plasmid pPGR-Script MuSK-wt using restriction sites Nael 
and Aatll. The two pPGR-Script vectors containing the modified MuSK sequence nt 1- 1614 
and 1 - 1726 are called pPCR-Script-MuSK-RI and pPCR-Script MuSK-RII, respectively. The 
correctness of the vectors are confirmed by restriction analysis and sequencing by methods well 
known in the art. 

20 

G. Generation of retroviral vectors containim mutated MuSK-Rs and viral supernatants: 

Wild-type and mutant MuSK-R are excised from pPGRScriptMuSK-Rwt, 
pPGRScriptMuSK-RI and pPGRScriptMuSK-RII using the NotI and Xhol site and are cloned 
into the multiple cloning site of the Moloney Murine Leukemia Virus (MoMLV) based 
25 retroviral vector pGla (GTI, Maryland) which is cut with NotI and Xhol. The retroviral vectors 
are designated pGlaMuSK-R, pGlaMuSK-RI and pGlaMuSK-RQ. The constructs 
pGlaMuSK-R, pGlaMuSK-RI and pGlaMuSK-RII were used to generate viral supernatants as 
described in Example 1. 

30 D. Tissue culture and cell lines: Experiments were carried out the same way as described in 
Example 1. 
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E. Transduction of PPA'6 and human T cell line: Experiments were carried out the same way 
as described in Example 1 

5 F. FACS analysis and immuno-magnetic bead selection of cells that express MUSK-RI and 
MUSK'RII: 

In addition to the antibodies that are listed in Example 1, anti-MuSK-R polyclonal serum, 
and anti-MuSK-R hybridoma supernatant (see section G) were used for flow cytometry and 
immuno-magnetic bead selection. . 

10 In order to isolate cells by immuno-magnetic bead selection, the cells are stained with an 

anti-MUSK-R antibody. For this purpose the 10 '^cells/ml are incubated with 1-3 ml anti-MuSK- 
R hybridoma supemantant in PBS/2%FBS for 1 hr on ice with occasional shaking. The cells are 
washed 3 times with PBS/2%FCS and then anti-IgG antibody coupled magnetic beads, that can 
recognize anti-MuSK-R antibodies, are added 5 beads per positive cell). The cells are 

15 incubated for 1 hr on ice. Cells that express MuSK-R are selected by positive selection with a 
Dynal magnet (Dynal, Oslo) for 10 min. The unbound cells are removed and the MuSK-R 
expressing cells are put into culture as described in section D. 

G, Generation of a monoclonal antibody against the extracellular domain of MuSK-R: 
20 To generate monoclonal antibodies against the extracellular domain (XC) of MuSK-R, 

MuSK-R XC is amplified by PCR and cloned into the expression construct pSecTag2b 
(Invitrogen). Cloning the XC domain of MuSK-R into the multiple cloning site (MCS) of the 
plasmid pSecTag2b allows for the expression of the XC under the control of the CMV promoter. 
In addition, the plasmid contains the sequence of a myc and (His)6-tag after the multiple cloning 
25 site, which allows to fuse the protein of interest (MuSK-RXC) to the myc and (His)6-tag. The 
signal peptide of MuSK-R is replaced by the IgK leader. (Figure 3). The extracellular domain of 
MuSK-R without the signal peptide is ampUfied by PCR using the following primers wherein p 
means phosphorylated: 

MuSK 1 16FPC: 5^ pCT TCC AAA AGC TCC TGT CAT CAC C SEQ ID NO: 15 
30 MuSK 1532RC: 5' pCC AGT CAT GGA GTA TGT AGG TGA GAC SEQ ID NO: 16 
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Primer MuSK116FPC starts with the sequence after the signal peptide (nt 69 - 93). 
Primer MuSK1532RC covers the sequence before the transmembrane domain starts and the first 
2 amino acids of the transmembrane domain corresponding to nucleotide sequence 1462-1586 
of SEQ I N0:7. For the PGR reaction ^10 ng hMuSK-R wt DNA are mixed with 1 x Pfu 
5 buffer, 1 \xg of primer MuSKl 16FPC and 1 |ag primer MuSK1532RC, 2.5 lumol of each dNTP, 
5 U Pfu polymerase and H2O in a final volume of 50 |li1. The PGR reaction is performed as 
follows: Gycle 1: 95°G for 5 min, Gycle 2-7: 96'*G for 0.5 min, 60^G for 1 min, 72^G for 6 min, 
and Gycle 8 - 27: 95°G for 0.58 min, 58°G for 1 min, 72^C for 6 min; Gycle 28: 72^G for 10 min. 
The PGR reaction is cooled to 4''G in the PGR machine and then gel-purified. The PGR 

10 firagment is cloned into to the EcoRV restriction site of pSecTag2b. Li this way MuSK-R XC is 
cloned in fi*ame with the Igk leader at the N-terminus and the myc- and (His)6-tag at the G- 
terminus. The resuhing plasmid is called pSecTag-hMuSK-R. 

To express the MuSK-R XG the plasmid pSecTag-hMuSK-R is transfected into 293T 
cells by the CaCh technique (as described in section G). 24 hrs after the transfection, the media 

15 is replaced with either fresh DMEM/10%FBS or serum-free X-Vivo 15. Supematants of the 
transfected cells are collected after 48 and 72 hrs. A total of 400 ml supematants are collected 
and are frozen at -80*^G until the supematants are purified. 

The MuSK-R XG is purified from tissue culture supematants by immobilized metal 
affinity chromatography. The metal ion is 0.1 M NiG^. The column is a 1 or 5 ml Pharmacia 

20 metal HiTrap chelating sepharose column. The equilibration buffer (Buffer A) consisted of 20 
mM Na2HP04 pH 7.4, IM guanidine hydrochloride, IM NaGl, filtered through 0.2 j^M cellulose 
acetate filter. The elution buffer (Buffer B) is 20 mM Na2HP04, pH 7.4, 1 M guanidine 
hydrochloride, IM NaGl, 0.5 M imidazole, filtered through 0.2 |aM cellulose acetate filter. The 
Pharmacia FPLG chromatography system is used to mn columns, with FPLG director program 

25 software and a Pharmacia P50 pump. The purification is performed at 4°G. 

The pump is primed with buffer A before the load is started. Before the column the 
column is attached, the load is pumped through until the pink color of the tissue culture media is 
seen at the connection so that the column is not washed with non-equilibration conditions. 

The tissue culture supematants are adjusted to contain 0.85 M NaGl, IM guanidinium 

30 chloride and 40 mM imidazole and the pH is adjusted to 7.4. The column is equilibrated with 
8% buffer B. The sample is loaded and the column then washed in above conditions for seven 
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column volumes. MuSK-R is eluted at 30% Buffer B (150 mM imidazole) over eight column 
volumes. Fractions are collected from start of the run. 

Each fraction is tested in a Dot Blot Assay (see below). Selected positive fractions are 
tested in Westem Blot assays and Elisa (see below). Positive fractions are pooled and dialyzed 
5 in 10,000 MWCO membrane (Pierce Snakeskin) against PBS. After dialysis, the optical density 
is determined at OD280. The samples are filtered through 0.2 |liM filters and then concentrated 
in Centricon Centriprep 30 devices in a refiigerated Sorvall RT6000D according to 
manufacturer's protocol. 

To test for positive samples in the dot blot assay, 10 jil of each fraction is pipetted on 

10 nitrocellulose. The membrane is dried, blocked with superblock and then probed for MuSK-R 
protein and developed as described for the fridia Westem (see below). 

Westem Blots are performed by methods well known in the art. The materials (sample 
buffer, running buffer and gels) are obtained from Novex. 

For westem blot analysis 25 to 35 jiil/fraction are used. Guanidine containing fractions 

15 are precipitated in ice-cold ethanol and are stored on ice for 15 minutes or overnight at 4^C. 
The samples are centriftiged in refrigerated microcentrifiige (TOMY) at 14,000 rpm for 10 min. 
The supernatant is discarded, ice-cold acetone is added and centrifuged as before. The pellet is 
resuspended in SDS sample buffer (Novex) with 5% p-mercaptoethanol in a final volume of 50- 
70 |j,l. The samples are denatured at > 90''C for 5 minutes, briefly centrifuged, and 25 - 35 |il 

20 loaded on a 4-20% gradient gel. The gel is blotted onto 0.45 |iM nitrocellulose for 1 .4 hours at 
100 volts, using the Biorad wet transfer blotting cassette with tris-glycine-methanol transfer 
buffer (25 mM trizma Base, 192 mM glycine, 20% methanol). After blotting the gel, the blot is 
blocked in Pierce TBS superblock for 10 minutes with mild agitation. The blot is washed twice 
in TBST (50 mM Tris, pH 7.5, 150 mM NaCl, 0.05% Tween 20) for five min per wash on a 

25 rotating platform. The Pierce India^^ His-HRP Probe is diluted to 1:5000 in TBST and the blot 
is incubated with the probe for 1 hour at room temperature and washed 3 times in TBST. After 
that, horseradish peroxidase reagent (Sigma Fast HRP hisoluble Substrate D4418) is added to 
the blot and the blot is developed. The blot is washed in three changes of water to stop 
development. Alternatively an mouse anti-c-myc antibody, (Santa Cruz Biotechnology; CA) is 

30 used to detect recombinant MuSK-R protein. This antibody is diluted in superblock to 1 jiig/ml 
final concentration. The blot is washed three times with TBST and then a goat anti-mouse IgG- 
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HRP antibody (Sigma) is added at 1:5000 dilution in superblock. The blot is incubated for 1 
hour at room temperature, with gentle agitation and developed as described above with Fast 
HRP insoluble substrate (Sigma). The recognized protein traveled at about 85 kD on the SDS 
PAGE, and it is considered to be 19 kD heavier due to glycosylation. 
5 The recombinant MuSK-R protein is injected into 3 different Balb/c mice. For this 

purpose 25-50 |Lig are mixed with 2.25 mg alhydrogel and 100 fig MDP (muranyl dipeptide; 
Pierce) in a final volume of 200 [il and injected 5 times every 14 days subcutaneously. After the 
3 and the 5 injection serum of the 3 mice are tested for reactivity against MuSK-R by FACS 
analysis and EUsa. For FACS analysis the 5x10^ cells of cell lines CEMSS and CEMSSMuSK- 

10 R are used. Both the presera and sera are diluted 1:100. 1:300, 1:900 and 1:2700. A rat anti- 
mouse IgG-PE antibody is used as a secondary reagent at a 1:20 dilution. For the Elisa, 96 well 
plates are coated with 50 j^l of 10 |ug/ml anti-mouse IgGFc (Jackson; Maine) The plates are 
incubated with various dilutions of sera (1:100 to 1:218700), subsequently with MuSK-R 
protein and with Nickel activated horse radish peroxidase at a 1:1000 dilution (HRP, Pierce). 

15 Nickel activated HRP is binding to the recombinant MuSK-R protein via the (His)6 tag. To 
develop the Elisa, the plate is incubated with TMP peroxidase substrate (Zymed; CA). In both 
assays one mouse shows the highest reactivity against native and recombinant MuSK-R. This 
mouse is boosted with a 6^^ injection of 200 p,g MuSK-R protein in PBS. The injection is done 
subcutaneously and intravenously. 1 week later the spleen is removed, lymphocytes isolated 

20 with lympholite M (Accurate Chemicals) and fused, using 50% polyethylene glycol to the 
myeloma cell Hne P3X63AG8.0653 using standard procedures. The resulting hybridomas are 
grown in bulk in HAT media for one week. Viable cells are recovered using lympholite M and 
cultured in HAT media plus cloning factor (Igen). After the hybridoma are grown for another 
week, a batch of the cells are cryopreserved in HAT media plus 10% DMSO. Another batch of 

25 the cells is subdivided into individual clones by FACSsorting using the single cell deposit unit. 
The cells are sorted by forward and side scatter and for PI negative cells. The cells are grown 
up in HT media for two weeks. The supematants are tested by EUsa and FACS (as described 
above) for monoclonal antibodies that can recognize native and recombinant MuSK-R protein. 
The antibodies are isotyped in an Elisa assay by using secondary antibodies that react with 

30 IgGl, 2a, 2b, 3, IgM, k, and X (Caltag, CA). 
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Three monoclonal antibodies are identified HI, H2 and H4. All three can react with 
MuSK-R expressed on the cell ine CEMSS-MuSK-R in a FACS assay. HI is an IgGl, k; H2 is 
IgGl, k; H4 is IgM antibody. 
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cctaagatcc cgtccatcgc cactgggatg gtgggggccc tcctcttgct gctggtggtg 198 0 

gccctgggga tcggcctctt catgcgaagg cgccacatcg ttcggaagcg cacgctgcgg 204 0 

aggctgctgc aggagaggga gcttgtggag cctcttacac ccagtggaga agctcccaac 2100 

caagctctct tgaggatctt gaaggaaact gaattcaaaa agatcaaagt gctgggctcc 216 0 

ggtgcgttcg gcacggtgta taagggactc tggatcccag aaggtgagaa agttaaaatt 222 0 

cccgtcgcta tcaaggaatt aagagaagca acatctccga aagccaacaa ggaaatcctc 2280 

gatgaagcct acgtgatggc cagcgtggac aacccccacg tgtgccgcct gctgggcatc 2340 

tgcctcacct ccaccgtgca actcatcacg cagctcatgc ccttcggctg cctcctggac 2400 

tatgtccggg aacacaaaga caatattggc tcccagtacc tgctcaactg gtgtgtgcag 2460 

10 atcgcaaagg gcatgaacta cttggaggac cgtcgcttgg tgcaccgcga cctggcagcc 2520 

aggaacgtac tggtgaaaac accgcagcat gtcaagatca cagattttgg gctggccaaa 2580 

ctgctgggtg cggaagagaa agaataccat gcagaaggag gcaaagtgcc tatcaagtgg 2640 

atggcattgg aatcaatttt acacagaatc tatacccacc agagtgatgt ctggagctac 2 7 00 

ggggtgaccg tttgggagtt gatgaccttt ggatccaagc catatgacgg aatccctgcc 2760 

agcgagatct cctccatcct ggagaaagga gaacgcctcc ctcagccacc catatgtacc 2820 

atcgatgtct acatgatcat ggtcaagtgc tggatgatag acgcagatag tcgcccaaag 2 8 80 

ttccgtgagt tgatcatcga attctccaaa atggcccgag acccccagcg ctaccttgtc 2940 

attcaggggg atgaaagaat gcatttgcca agtcctacag actccaactt ctaccgtgcc 3000 

ctgatggatg aagaagacat ggacgacgtg gtggatgccg acgagtacct catcccacag 3060 

20 cagggcttct tcagcagccc ctccacgtca cggactcccc tcctgagctc tctgagtgca 3120 

accagcaaca attccaccgt ggcttgcatt gatagaaatg ggctgcaaag ctgtcccatc 3180 

aaggaagaca gcttcttgca gcgatacagc tcagacccca caggcgcctt gactgaggac 324 0 

agcatagacg acaccttcct cccagtgcct gaatacataa accagtccgt tcccaaaagg 3 30 0 

cccgctggct ctgtgcagaa tcctgtctat cacaatcagc ctctgaaccc cgcgcccagc 3360 

siga.gsicccac actaccagga cccccacagc actgcagtgg gcaaccccga gtatctcaac 3420 

actgtccagc ccacctgtgt caacagcaca ttcgacagcc ctgcccactg ggcccagaaa 348 0 

ggcagccacc aaattagcct ggacaaccct gactaccagc aggacttctt tcccaaggaa 3540 

gccaagccaa atggcatctt taagggctcc acagctgaaa atgcagaata cctaagggtc 3600 

gcgccacaaa gcagtgaatt tattggagca tga 3 633 

30 

<210> 2 
<211> 1210 
<212> PRT 
<213> EGFR 
<400> 2 

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
15 10 15 

Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gin 
40 20 25 30 

Gly Thr Ser Asn Lys Leu Thr Gin Leu Gly Thr Phe Glu Asp His Phe 

35 40 45 

Leu Ser Leu Gin Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 

50 55 60 

Leu Glu lie Thr Tyr Val Gin Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65 70 75 80 

Thr He Gin Glu Val Ala Gly Tyr Val Leu He Ala Leu Asn Thr Val 

85 90 95 

Glu Arg He Pro Leu Glu Asn Leu Gin He He Arg Gly Asn Met Tyr 
50 100 105 110 

Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 

115 120 125 

Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gin Glu He Leu 
130 135 140 



His 
145 
Ser 

Ser 

Ser 

Lys 

10 

Gly 
225 
Thr 

Glu 

Thr 

Ala 

20 

Gly 
305 
Asp 

Cys 

Ala 

Leu 

30 

Pro 
385 
He 

Leu 

His 

Gly 

40 

Gly 
465 
Phe 

Asn 

Glu 

Val 

50 

Glu 
545 
Glu 



Gly Ala Val 

He Gin Trp 

Met Asp Phe 
180 

Cys Pro Asn 
195 

Leu Thr Lys 
210 

Lys Ser Pro 

Gly Pro Arg 

Ala Thr Cys 
260 

Thr Tyr Gin 

275 
Thr Cys Val 
290 

Ser Cys Val 

Gly Val Arg 

Asn Gly He 
340 

Thr Asn He 
355 

His He Leu 
370 

Pro Leu Asp 

Thr Gly Phe 

His Ala Phe 
420 

Gly Gin Phe 

435 
Leu Arg Ser 
450 

Asn Lys Asn 

Gly Thr Ser 

Ser Cys Lys 
500 

Gly Cys Trp 

515 
Ser Arg Gly 
530 

Pro Arg Glu 
Cys Leu Pro 



Arg Phe Ser 
150 

Arg Asp He 
165 

Gin Asn His 

Gly Ser Cys 

He He Cys 
215 

Ser Asp Cys 

230 
Glu Ser Asp 
245 

Lys Asp Thr 

Met Asp Val 

Lys Lys Cys 
295 

Arg Ala Cys 

310 
Lys Cys Lys 
325 

Gly He Gly 

Lys His Phe 

Pro Val Ala 
375 

Pro Gin Glu 
390 

Leu Leu He 
405 

Glu Asn Leu 

Ser Leu Ala 

Leu Lys Glu 
455 

Leu Cys Tyr 

470 
Gly Gin Lys 
485 

Ala Thr Gly 

Gly Pro Glu 

Arg Glu Cys 
535 

Phe Val Glu 

550 
Gin Ala Met 
565 
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Asn Asn Pro 

Val Ser Ser 
170 

Leu Gly Ser 

185 
Trp Gly Ala 
200 

Ala Gin Gin 

Cys His Asn 

Cys Leu Val 
250 

Cys Pro Pro 

265 
Asn Pro Glu 
280 

Pro Arg Asn 

Gly Ala Asp 

Lys Cys Glu 
330 

Glu Phe Lys 

345 
Lys Asn Cys 
360 

Phe Arg Gly 

Leu Asp He 

Gin Ala Trp 
410 

Glu He He 

425 
Val Val Ser 
440 

He Ser Asp 

Ala Asn Thr 

Thr Lys He 
490 

Gin Val Cys 

505 
Pro Arg Asp 
520 

Val Asp Lys 

Asn Ser Glu 

Asn He Thr 
570 



Ala Leu Cys 
155 

Asp Phe Leu 

Cys Gin Lys 

Gly Glu Glu 
205 

Cys Ser Gly 

220 
Gin Cys Ala 
235 

Cys Arg Lys 

Leu Met Leu 

Gly Lys Tyr 
285 

Tyr Val Val 

300 
Ser Tyr Glu 
315 

Gly Pro Cys 

Asp Ser Leu 

Thr Ser He 
365 

Asp Ser Phe 

380 
Leu Lys Thr 
395 

Pro Glu Asn 

Arg Gly Arg 

Leu Asn He 
445 

Gly Asp Val 

460 
He Asn Trp 
475 

He Ser Asn 

His Ala Leu 

Cys Val Ser 
525 

Cys Asn Leu 

540 
Cys He Gin 
555 

Cys Thr Gly 



Asn Val Glu 
160 

Ser Asn Met 

175 
Cys Asp Pro 
190 

Asn Cys Gin 

Arg Cys Arg 

Ala Gly Cys 
240 

Phe Arg Asp 

255 
Tyr Asn Pro 
270 

Ser Phe Gly 

Thr Asp His 

Met Glu Glu 
320 

Arg Lys Val 

335 
Ser He Asn 
350 

Ser Gly Asp 

Thr His Thr 

Val Lys Glu 
400 

Arg Thr Asp 

415 
Thr Lys Gin 
430 

Thr Ser Leu 

He He Ser 

Lys Lys Leu 
480 

Arg Gly Glu 

495 
Cys Ser Pro 
510 

Cys Arg Asn 

Leu Glu Gly 

Cys His Pro 
560 

Arg Gly Pro 
575 
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Asp Asn Cys lie Gin Cys Ala His Tyr lie Asp Gly Pro His Cys Val 

580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 

595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 

610 615 620 

Thr Tyr Gly Cys Thr Gly Pro Gly Leu Glu Gly Cys Pro Thr Asn Gly 
625 630 635 640 

Pro Lys lie Pro Ser lie Ala Thr Gly Met Val Gly Ala Leu Leu Leu 
10 645 650 655 

Leu Leu Val Val Ala Leu Gly lie Gly Leu Phe Met Arg Arg Arg His 

660 665 670 

lie Val Arg Lys Arg Thr Leu Arg Arg Leu Leu Gin Glu Arg Glu Leu 

675 680 685 

Val Glu Pro Leu Thr Pro Ser Gly Glu Ala Pro Asn Gin Ala Leu Leu 

690 695 700 

Arg lie Leu Lys Glu Thr Glu Phe Lys Lys lie Lys Val Leu Gly Ser 
705 710 715 720 

Gly Ala Phe Gly Thr Val Tyr Lys Gly Leu Trp lie Pro Glu Gly Glu 
20 725 730 735 

Lys Val Lys He Pro Val Ala He Lys Glu Leu Arg Glu Ala Thr Ser 

740 745 750 

Pro Lys Ala Asn Lys Glu He Leu Asp Glu Ala Tyr Val Met Ala Ser 

755 760 765 

Val Asp Asn Pro His Val Cys Arg Leu Leu Gly He Cys Leu Thr Ser 

770 775 780 

Thr Val Gin Leu He Thr Gin Leu Met Pro Phe Gly Cys Leu Leu Asp 
785 790 795 800 

Tyr Val Arg Glu His Lys Asp Asn He Gly Ser Gin Tyr Leu Leu Asn 
30 805 810 815 

Trp Cys Val Gin He Ala Lys Gly Met Asn Tyr Leu Glu Asp Arg Arg 

820 825 830 

Leu Val His Arg Asp Leu Ala Ala Arg Asn Val Leu Val Lys Thr Pro 

835 840 845 

Gin His Val Lys He Thr Asp Phe Gly Leu Ala Lys Leu Leu Gly Ala 

850 855 860 

Glu Glu Lys Glu Tyr His Ala Glu Gly Gly Lys Val Pro He Lys Trp 
865 870 875 880 

Met Ala Leu Glu Ser He Leu His Arg He Tyr Thr His Gin Ser Asp 
40 885 890 895 

Val Trp Ser Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ser 

900 905 910 

Lys Pro Tyr Asp Gly He Pro Ala Ser Glu He Ser Ser He Leu Glu 

915 920 925 

Lys Gly Glu Arg Leu Pro Gin Pro Pro He Cys Thr He Asp Val Tyr 

930 935 940 

Met He Met Val Lys Cys Trp Met He Asp Ala Asp Ser Arg Pro Lys 
945 950 955 960 

Phe Arg Glu Leu He He Glu Phe Ser Lys Met Ala Arg Asp Pro Gin 
50 965 970 975 

Arg Tyr Leu Val He Gin Gly Asp Glu Arg Met His Leu Pro Ser Pro 

980 985 990 

Thr Asp Ser Asn Phe Tyr Arg Ala Leu Met Asp Glu Glu Asp Met Asp 
995 1000 1005 
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Asp Val Val Asp Ala Asp Glu Tyr Leu lie Pro Gin Gin Gly Phe 

1010 1015 1020 

Phe Ser Ser Pro Ser Thr Ser Arg Thr Pro Leu Leu Ser Ser Leu 

1025 1030 1035 

Ser Ala Thr Ser Asn Asn Ser Thr Val Ala Cys lie Asp Arg Asn 

1040 1045 1050 

Gly Leu Gin Ser Cys Pro lie Lys Glu Asp Ser Phe Leu Gin Arg 

1055 1060 1065 

Tyr Ser Ser Asp Pro Thr Gly Ala Leu Thr Glu Asp Ser lie Asp 
10 1070 1075 1080 

Asp Thr Phe Leu Pro Val Pro Glu Tyr lie Asn Gin Ser Val Pro 

1085 1090 1095 

Lys Arg Pro Ala Gly Ser Val Gin Asn Pro Val Tyr His Asn Gin 

1100 1105 1110 

Pro Leu Asn Pro Ala Pro Ser Arg Asp Pro His Tyr Gin Asp Pro 

1115 1120 1125 

His Ser Thr Ala Val Gly Asn Pro Glu Tyr Leu Asn Thr Val Gin 

1130 1135 1140 

Pro Thr Cys Val Asn Ser Thr Phe Asp Ser Pro Ala His Trp Ala 
20 1145 1150 1155 

Gin Lys Gly Ser His Gin lie Ser Leu Asp Asn Pro Asp Tyr Gin 

1160 1165 1170 

Gin Asp Phe Phe Pro Lys Glu Ala Lys Pro Asn Gly lie Phe Lys 

1175 1180 1185 

Gly Ser Thr Ala Glu Asn Ala Glu Tyr Leu Arg Val Ala Pro Gin 

1190 1195 1200 

Ser Ser Glu Phe lie Gly Ala 

1205 1210 
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<210> 3 
<211> 31 
<212> DNA 
<213> primer 
<400> 3 

ctaggctagc atgcgaccct ccgggacggc c 31 



<210> 4 
40 <211> 42 

<212> DNA 
<213> primer 
<400> 4 

ctctgcccgg cgactcgggc tgacagctat gagatggagg aa 42 



<210> 5 
<211> 39 
<212> DNA 
50 <213> primer 

<400> 5 

ttcctccatc tcatagctag cccgactcgc cgggcagag 39 
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<210> 6 
<211> 31 
<212> DNA 
<213> primer 
<400> 6 

ggatatccta cgtgcgcttc cgaacgatgt g 31 



<210> 7 
10 <211> 2607 

<212> DNA 
<213> MuSK 
<400> 7 

atgagagagc tcgtcaacat tccactggta 
actgagaaac ttccaaaagc tcctgtcatc 
gttgaagaag tggctacttt catgtgtgca 
tggactagaa ataaaattct cattaaactc 
gggcagctcc tcaccatcct gagtgtggaa 
gccaacaatg gtgtgggagg agctgtggag 

20 cctaaaataa ctcgtcctcc cataaatgtg 

ccatgtacta caatgggtaa tcccaaacca 
ctcagggaaa attcccgaat tgcagttctt 
caaaaggaag atgcaggaca gtatcgatgt 
tccaaagtgg tgaagctgga atttgaggtt 
cacaatgtca cctttggctc ctttgtgacc 
cccaccatca cctggattga aaacggaaat 
gtgaaagacc gagtgattga ctcaagactg 
acatgcatag ctaccaataa gcatggggag 
atcagcatag cagaatggag taaaccacag 

30 agaggggagg tgtgtaatgc agtcctggca 

tatgcggacc ctgaggaggc ccaagagcta 
gtagtgagcc cagtctgccg gccagctgct 
gagtgcagtc ctggagtagt gcctactcct 
gtaaaggagc tcttctgcgc aaaagaatgg 
ctctacagat ccgagatgca tttgctgtcc 
cattgggacc ccacggcctg tgccagactg 
aaaacattcc caccaatgac gtcctcaaag 
tcctcctctt cttccttctc tgtctcacct 
atgtccagct ttgcaatatt tgtgcttctt 

40 agaaaacaat ggaaaaataa gaaaagagaa 

tctgagctct tactagatag acttcatccc 
ctgaacccca aattgctcag cctggagtat 
atcggagagg gagcgtttgg aagggtgttt 
gaacctttca ctatggtggc agtaaagatg 
gcggactttc agagggaggc agccctcatg 
ctattaggag tgtgtgctgt cgggaagcca 
ggtgacctca atgagttcct ccgcagcatg 
agtgacttgt ctatgagggc tcaggtctcc 
gagcagcttt gcattgccag gcaggtggca 

50 tttgttcacc gagatttagc caccaggaac 

attgccgact ttggcctctc caggaacatc 
aacgacgcta tccctatccg ttggatgcca 
acagagtctg atgtgtgggc ctatggcgtg 
cagccctact atgggatggc ccatgaggag 



catattctta ctctggttgc cttcagcgga 60 

accactcctc ttgaaacagt ggatgcctta 120 

gtggaatcct acccccagcc tgagatttcc 180 

tttgacaccc ggtacagcat ccgggagaat 240 

gacagtgatg atggcattta ctgctgcacg 3 00 

agttgtggag ccctgcaagt gaagatgaaa 3 60 

aaaataatag agggattaaa agcagtccta 42 0 

tcagtgtctt ggataaaggg agacagccct 480 

gaatctggga gcttgaggat tcataacgta 540 

gtggcaaaaa acagcctcgg gacagcatat 6 00 

tttgccagga tcctgcgggc tcctgaatcc 660 

ctgcactgta cagcaacagg cattcctgtc 72 0 

gctgtttctt ctgggtccat tcaagagagt 7 80 

cagctgttta tcaccaagcc aggactctac 840 

aagttcagta ctgccaaggc tgcagccacc 900 

aaagataaca aaggctactg cgcccagtac 960 

aaagatgctc ttgtttttct caacacctcc 1020 

ctggtccaca cggcctggaa tgaactgaaa 1080 

gaggctttgt tgtgtaacca catcttccag 1140 

attcccattt gcagagagta ctgcttggca 1200 

ctggtaatgg aagagaagac ccacagagga 1260 

gtgccaaaat gcagcaagct tcccagcatg 1320 

ccacatctag attataacaa agaaaaccta 1380 

ccaagtgtgg acattccaaa tctgccttcc 1440 

acatactcca tgactgtaat aatctccatc 1500 

accataacta ctctctattg ctgccgaaga 1560 

tcagcagcag taaccctcac cacactgcct 1620 

aaccccatgt accagaggat gccgctcctt 1680 

ccaaggaata acattgaata tgtgagagac 174 0 

caagcaaggg caccaggctt acttccctat 1800 

ctcaaagaag aagcctcggc agatatgcaa 1860 

gcagaatttg acaaccctaa cattgtgaag 1920 

atgtgcctgc tctttgaata catggcctat 1980 

tcccctcaca ccgtgtgcag cctcagtcac 2040 

agccctgggc ccccacccct ctcctgtgct 2100 

gctggcatgg cttacctctc agaacgtaag 2160 

tgcctggtgg gcgagaacat ggtggtgaaa 2220 

tactcagcag actactacaa agctaatgaa 22 8 0 

ccagagtcca ttttttataa ccgctacact 2340 

gtcctctggg agatcttctc ctatggcctg 2400 

gtcatttact acgtgcgaga tggcaacatc 2460 
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ctctcctgcc ctgagaactg ccccgtggag ctgtacaatc tcatgcgtct atgttggagc 2520 
aagctgcctg cagacagacc cagtttcacc agtattcacc gaattctgga acgcatgtgt 2580 
gagagggcag agggaactgt gagtgtc 2607 



<210> 8 
<211> 869 
<212> PRT 
<213> MuSK 
10 <400> 8 

Met Arg Glu Leu Val Asn lie Pro Leu Val His lie Leu Thr Leu Val 

15 10 15 

Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val He Thr Thr 

20 25 30 

Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Val Ala Thr Phe Met 

35 40 45 

Cys Ala Val Glu Ser Tyr Pro Gin Pro Glu He Ser Trp Thr Arg Asn 

50 55 60 

Lys He Leu lie Lys Leu Phe Asp Thr Arg Tyr Ser He Arg Glu Asn 
20 65 70 75 80 

Gly Gin Leu Leu Thr He Leu Ser Val Glu Asp Ser Asp Asp Gly He 

85 90 95 

Tyr Cys Cys Thr Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 

100 105 110 

Gly Ala Leu Gin Val Lys Met Lys Pro Lys He Thr Arg Pro Pro He 

115 120 125 

Asn Val Lys He He Glu Gly Leu Lys Ala Val Leu Pro Cys Thr Thr 

130 135 140 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp He Lys Gly Asp Ser Pro 
30 145 150 155 160 

Leu Arg Glu Asn Ser Arg He Ala Val Leu Glu Ser Gly Ser Leu Arg 

165 170 175 

He His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 

180 185 190 

Lys Asn Ser Leu Gly Thr Ala Tyr Ser Lys Val Val Lys Leu Glu Phe 

195 200 205 

Glu Val Phe Ala Arg He Leu Arg Ala Pro Glu Ser His Asn Val Thr 

210 215 220 

Phe Gly Ser Phe Val Thr Leu His Cys Thr Ala Thr Gly He Pro Val 
40 225 230 235 240 

Pro Thr He Thr Trp He Glu Asn Gly Asn Ala Val Ser Ser Gly Ser 

245 250 255 

He Gin Glu Ser Val Lys Asp Arg Val He Asp Ser Arg Leu Gin Leu 

260 265 270 

Phe He Thr Lys Pro Gly Leu Tyr Thr Cys He Ala Thr Asn Lys His 

275 280 285 

Gly Glu Lys Phe Ser Thr Ala Lys Ala Ala Ala Thr He Ser He Ala 

290 295 300 

Glu Trp Ser Lys Pro Gin Lys Asp Asn Lys Gly Tyr Cys Ala Gin Tyr 
50 305 310 315 320 

Arg Gly Glu Val Cys Asn Ala Val Leu Ala Lys Asp Ala Leu Val Phe 

325 330 335 

Leu Asn Thr Ser Tyr Ala Asp Pro Glu Glu Ala Gin Glu Leu Leu Val 
340 345 350 



His 

Ala 

Gly 
385 
Val 

Thr 

10 

Lys 

Arg 

Pro 
465 
Ser 

He 

20 

Thr 

Arg 

Leu 
545 
Leu 

Tyr 

30 

Arg 

Lys 

Arg 
625 
Leu 

Tyr 

40 

His 

Val 

He 
705 
Phe 

Met 

50 

Ala 
Met 



Thr Ala Trp 
355 

Ala Glu Ala 
370 

Val Val Pro 

Lys Glu Leu 

His Arg Gly 
420 

Cys Ser Lys 

435 
Leu Pro His 
450 

Met Thr Ser 

Ser Ser Ser 

He Ser He 
500 

Thr Leu Tyr 

515 
Glu Ser Ala 
530 

Asp Arg Leu 

Asn Pro Lys 

Val Arg Asp 
580 

Ala Pro Gly 

595 
Met Leu Lys 
610 

Glu Ala Ala 

Leu Gly Val 

Met Ala Tyr 
660 

Thr Val Cys 

675 
Ser Ser Pro 
690 

Ala Arg Gin 

Val His Arg 

Val Val Lys 
740 

Asp Tyr Tyr 

755 
Pro Pro Glu 
770 



Asn Glu Leu 

Leu Leu Cys 
375 

Thr Pro He 

390 
Phe Cys Ala 
405 

Leu Tyr Arg 

Leu Pro Ser 

Leu Asp Tyr 
455 

Ser Lys Pro 

470 
Ser Phe Ser 
485 

Met Ser Ser 

Cys Cys Arg 

Ala Val Thr 
535 

His Pro Asn 
550 

Leu Leu Ser 
565 

He Gly Glu 

Leu Leu Pro 

Glu Glu Ala 
615 

Leu Met Ala 

630 
Cys Ala Val 
645 

Gly Asp Leu 

Ser Leu Ser 

Gly Pro Pro 
695 

Val Ala Ala 
710 

Asp Leu Ala 
725 

He Ala Asp 

Lys Ala Asn 

Ser He Phe 
775 
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Lys Val Val 
360 

Asn His He 

Pro He Cys 

Lys Glu Trp 
410 

Ser Glu Met 

425 
Met His Trp 
440 

Asn Lys Glu 

Ser Val Asp 

Val Ser Pro 
490 

Phe Ala He 
505 

Arg Arg Lys 
520 

Leu Thr Thr 

Pro Met Tyr 

Leu Glu Tyr 
570 

Gly Ala Phe 
585 

Tyr Glu Pro 
600 

Ser Ala Asp 

Glu Phe Asp 

Gly Lys Pro 
650 

Asn Glu Phe 
665 

His Ser Asp 
680 

Pro Leu Ser 

Gly Met Ala 

Thr Arg Asn 
730 

Phe Gly Leu 

745 
Glu Asn Asp 
760 

Tyr Asn Arg 



Ser Pro Val 
365 

Phe Gin Glu 
380 

Arg Glu Tyr 
395 

Leu Val Met 

His Leu Leu 

Asp Pro Thr 
445 

Asn Leu Lys 

460 
He Pro Asn 
475 

Thr Tyr Ser 

Phe Val Leu 

Gin Trp Lys 
525 

Leu Pro Ser 

540 
Gin Arg Met 
555 

Pro Arg Asn 

Gly Arg Val 

Phe Thr Met 
605 

Met Gin Ala 
620 

Asn Pro Asn 
635 

Met Cys Leu 

Leu Arg Ser 

Leu Ser Met 
685 

Cys Ala Glu 

700 
Tyr Leu Ser 
715 

Cys Leu Val 

Ser Arg Asn 

Ala He Pro 
765 

Tyr Thr Thr 
780 



Cys Arg Pro 

Cys Ser Pro 

Cys Leu Ala 
400 

Glu Glu Lys 

415 
Ser Val Pro 
430 

Ala Cys Ala 

Thr Phe Pro 

Leu Pro Ser 
480 

Met Thr Val 

495 
Leu Thr He 
510 

Asn Lys Lys 

Glu Leu Leu 

Pro Leu Leu 
560 

Asn He Glu 

575 
Phe Gin Ala 
590 

Val Ala Val 

Asp Phe Gin 

He Val Lys 
640 

Leu Phe Glu 
655 

Met Ser Pro 
670 

Arg Ala Gin 

Gin Leu Cys 

Glu Arg Lys 
720 

Gly Glu Asn 

735 
He Tyr Ser 
750 

He Arg Trp 
Glu Ser Asp 
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Val Trp Ala Tyr 
785 

Gin Pro Tyr Tyr 

Asp Gly Asn lie 
820 

Asn Leu Met Arg 
835 

Phe Thr Ser lie 
10 850 

Gly Thr Val Ser 
865 



Gly Val Val Leu 
790 

Gly Met Ala His 
805 

Leu Ser Cys Pro 

Leu Cys Trp Ser 
840 

His Arg lie Leu 
855 

Val 



Trp Glu lie Phe 
795 

Glu Glu Val He 
810 

Glu Asn Cys Pro 
825 

Lys Leu Pro Ala 

Glu Arg Met Cys 
860 



Ser Tyr Gly Leu 
800 

Tyr Tyr Val Arg 
815 

Val Glu Leu Tyr 
830 

Asp Arg Pro Ser 
845 

Glu Arg Ala Glu 



<210> 9 

<211> 25 

<212> DNA 

<213> primer 

<400> 9 



20 cgtcctgcgt gagcctggat taatc 2 5 
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<210> 10 
<211> 25 
<212> DMA 
<213> primer 
<400> 10 

gcctggatta atcatgagag agctc 25 



<210> 11 
<211> 42 
<212> DNA 
<213> primer 
<400> 11 

cgaggcctgt cttcaacctt agacactcac agttccctct gc 42 



<210> 12 
40 <211> 27 

<212> DNA 
<213> primer 
<400> 12 

cggcctgtgc cagactgcca catctag 2 7 



<210> 13 
<211> 32 
<212> DNA 
50 <213> primer 

<400> 13 

cgtctaggtg agggttactg ctgctgattc tc 32 
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<210> 14 
<211> 38 
<212> DNA 
<213> primer 
<400> 14 

ggttaaccct attcaatgtt attccttgga tactccag 38 
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<210> 15 
<211> 24 
<212> DNA 



<213> primer 
<400> 15 

cttccaaaag ctcctgtcat cacc 
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<210> 16 
<211> 26 
<212> DNA 



<213> primer 
<400> 16 

ccagtcatgg agtatgtagg tgagac 



26 
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IT IS CLAIMED: 



1 . A method of identifying genetically modified mammalian cells comprising the steps of: 

a) introducing a nucleic acid sequence encoding a mutated protein-tyrosine kinase receptor 
(PTKR) operatively linked to an expression control sequence into a cell to form a 
genetically modified cell, wherein the mutated PTKR either comprises a modification to 
the intracellular and the extracellular domains, or excludes any nerve growth factor 
receptor(s) (NGFR) and comprises a modification to the intracellular domain; 

b) allowing expression of the mutated PTKR in the genetically modified cell; and 
1 0 c) identifying said genetically modified cell expressing the mutated PTKR. 

2. The method according to claim 1, wherein the mutated PTKR is selected fi-om a mutated 
epidermal growth factor receptor (EGFR) family member and muscle specific tyrosine kinase 
receptor (MuSK-R) family member. 

3. The method according to claim 2, wherein the mutated EGFR family member is a 
mutated EGFRl, optionally selected from the sequence designated EGFRl-I and EGFRl-H. 

4. The method according to claim 2, wherein the mutated MuSK-R is selected from the 
20 sequence designated mMuSK-RI and mMuSK-RIL 

5. The method according to any preceding claims, wherein the mutated PTKR is truncated 
from the intracellular domain, and optionally also from the extracellular domain. 

6. The method according to claim 2, wherein the introducing step is accomplished by 
incorporating the nucleic acid sequence encoding the mutated EGFR or MuSK-R into a vector 
and introducing said vector into said cell. 



7. The method according to claim 6, wherein the vector is a retroviral vector. 

30 
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8. The method according to claim 1, wherein said identifying step is accomplished by 
contacting the genetically modified cells with an antibody that recognizes and binds to the 
mutated PTKR. 

9. The method according to claim 1, wherein the identifying step separates the genetically 
modified cells from the non-genetically modified cells. 

10. The method according to claim 1, further comprising the step of separating the identified 
cells expressing the mutated PTKR. 

1 1 . The method according to claim 1 , wherein the cells are himian cells. 

12. The method according to claim 11, wherein the cells are selected firom the group 
consisting of hematopoietic cells, hver cells, endotheUal cells and smooth muscle cells. 

13. The method according to claim 1 1 , wherein the cells are hematopoietic cells. 

14. The method according to claim 13, wherein the cells are hematopoietic cells are stem 
cells or T-cells. 

15. The method according to claim 6, wherein a heterologous gene is also incorporated into 
said vector. 

16. A method of identifying genetically modified mammaUan cells comprising the steps of: 

a) incorporating into a vector a nucleic acid sequence encoding a mutated protein- 
tyrosine kinase receptor (PTKR) family member, wherein said PTKR either 
comprises a modification to the intracellular and the extracellular domains, or 
excludes any nerve growth factor receptor(s) (NGFR) and comprises a modification 
to the intracellular domain; introducing the vector into a mammalian cell to form a 
genetically modified cell; 

b) allowing expression of the mutated PTKR in the genetically modified cell; and 



53 

c) identifying said genetically modified cell expressing the mutated PTKR. 

17. The method according to claim 16, wherein the mutated PTKR is selected from EGFR, 
preferably EGFRl-I or EGFRl-II, and MuSK-R, preferably mMuSK-RI or mMuSK-RH. 

1 8. The method according to claim 1 6, wherein the vector is a retroviral vector. 

19. The method according to claim 16, wherein a heterologous gene is also incorporated into 
said vector. 

20. A method for the inrmunoselection of transduced mammalian cells comprising, 

a) retrovirally transducing mammalian cells with a nucleic acid sequence encoding a 
protein-tyrosine kinase receptor (PTfCR) family member operatively hnked to an 
expression control sequence; wherein said PTKR either comprises a modification to 
the intracellular and the extracellular domains, or excludes any nerve growth factor 
receptor(s) (NGFR) and comprises a modification to the intracellular domain; 

b) incubating the transduced cells with a marked antibody which recognizes and binds 
specifically to the mutated PTKR; and 

c) identifying the marked transduced cells. 

21. The method according to claim 20, wherein the cells are human cells or hematopoietic 
cells. 

22. The method according to claim 20, wherein the mutated PTKR is selected from EGFR, 
preferably EGFRl-I or EGFRl-II, and MuSK-R, preferably mMuSK-RI or mMuSK-RH. 

23. The method according to claim 20, wherein the cells are transduced by a retroviral 
vector derived from the group consisting of moloney murine leukemia virus (MoMLV), 
myeloproUferative sarcoma virus (MPSV), murine embryonic stem cell vims (MESV), 
murine stem cell virus (MSCV) and spleen focus forming virus (SFFV). 
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24. The method according to claim 20, wherein the cells are transduced by a vector derived 
from a lentivirus. 

25. The method according to claim 20, further comprising the step of separating the 
identified marked transduced cells from non-marked cells. 

26. The method according to claim 20, fiirther comprising the step of expanding the marked 
transduced cells. 

10 27. A method of identifying mammalian cells expressing a protein of interest, comprising 
the steps of, 

a) introducing into a mammalian cell a nucleic acid encoding a nucleic acid comprising 
a DNA sequence encoding a protein of interest and encoding a protein-tyrosine 
kinase receptor (PTKR) family member operatively linked to an expression control 
sequence, wherein said PTKR either comprises a modification to the intracellular 
and the extracellular domains, or excludes any nerve growth factor receptor(s) 
(NGFR) and comprises a modification to the intracellular domain; 

b) cultimng the resulting mammalian cells; and 

c) identifying cells which express the mutated PTKR thereby obtaining cells which 
express the protein of interest. 

The method according to claim 27, wherein the mutated PTKR is selected from EGFR, 
preferably EGFRl-I or EGFRl-II, and MuSK-R, preferably mMuSK-RI or mMuSK-RH. 

The method according to claim 27, wherein the nucleic acid encoding the mutated PTKR 
and the nucleic acid encoding the protein of interest in step (a) are introduced on a 
retroviral vector. 

A mutated muscle specific tyrosine kinase receptor (MuSK-R) family member which is 
truncated within the cytoplamic domain, preferably mMuSK-RI or mMuSK-RH. 



20 

28. 
29. 

30. 

30 
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ABSTRACT 



The present invention concerns a method of identifying genetically modified mammalian cells 
using a mutated protein-tyrosine kinase receptor (PTKR) as a selectable marker in mammalian 
cells. Particularly preferred mutated PTKR selective markers are mutated epidermal growth 
factor receptor (EGFR) family members, and muscle specific tyrosine kinase receptor (MuSK- 
R) family members. Further a method for the immunoselection of transduced mammahan cells 
is disclosed comprising retrovirally transducing mammalian cells with a nucleic acid sequence 
encoding a mutated EGFR, incubating the transduced cells with a marked antibody which 
recognizes and binds specifically to the mutated PTKR, and identifying the marked transduced 
cells. 



signal cysteine-rich 
sequence domains 



e 

M 

g ^tyrosine 
kinase 
domain 



A.) 



f^wmzzzz 



V 



72L 



N BL///l || l 



HP IP BP 
Ul U UL 



C EGFRwt 



mutated 
EGFR-il 

i»79-1210 



Ig-BKe 
domain 



tyrosine 
Ifinase 
domain 



B.) 



N 



N 




C wtlVIuSK 




C mutated MuSK 



A587-8G9 



In A.) ttie the wild-fype and tiie mutated EGFR-II are shown, in B.) the 
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CCGGGACGGL 
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2900 


ATTCTCCAAA 


ATGGCCCGAG 


ACCCCCAGCG 


CTACCTTGTC 


ATTCAGGGGG 


2950 


ATGAAAGAAT 


GCATTTGCCA 


AGTCCTACAG 


ACTCCAACTT 


CTACCGTGCC 


3000 


CTGATGGATG 


AAGAAGACAT 


GGACGACGTG 


GTGGATGCCG 


ACGAGTACCT 


3050 


CATCCCACAG 


CAGGGCTTCT 


TCAGCAGCCC 


CTCCACGTCA 


CGGACTCCCC 


3100 


TCCTGAGCTC 


TCTGAGTGCA 


ACCAGCAACA 


ATTCCACCGT 


GGCTTGCATT 


3150 


GATAGAAATG 


GGCTGCAAAG 


CTGTCCCATC 


AAGGAAGACA 


GCTTCTTGCA 


3200 


GCGATACAGC 


TCAGACCCCA 


CAGGCGCCTT 


GACTGAGGAC 


AGCATAGACG 


3250 


ACACCTTCCT 


CCCAGTGCCT 


GAATACATAA 


ACCAGTCCGT 


TCCCAAAAGG 


3300 


CCCGCTGGCT 


CTGTGCAGAA 


TCCTGTCTAT 


CACAATCAGC 


CTCTGAACCC 


3350 


CGCGCCCAGC 


AGAGACCCAC 


ACTACCAGGA 


CCCCCACAGC 


ACTGCAGTGG 


3400 


GCAACCCCGA 


GTATCTCAAC 


ACTGTCCAGC 


CCACCTGTGT 


CAACAGCACA 


3450 


TTCGACAGCC 


CTGCCCACTG 


GGCCCAGAAA 


GGCAGCCACC 


AAATTAGCCT 


3500 


GGACAACCCT 


GACTACCAGC 


AGGACTTCTT 


TCCCAAGGAA 


GCCAAGCCAA 


3550 


ATGGCATCTT 


TAAGGGCTCC 


ACAGCTGAAA 


ATGCAGAATA 


CCTAAGGGTC 


3600 


GCGCCACAAA 


GCAGTGAATT 


TATTGGAGCA 


TGA 




3630 
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MRELVNIPLV HILTLVAFSG TEKLPKAPVI TTPLETVDAL VEEVATFMCA 50 

VESYPQPEIS WTRNKILIKL FDTRYSIREN GQLLTILSVE DSDDGIYCCT 100 

ANNGVGGAVE SCGALQVKMK PKITRPPINV KIIEGLKAVL PCTTMGNPKP 150 

5 SVSWIKGDSP LRENSRIAVL ESGSLRIHNV QKEDAGQYRC VAKNSLGTAY 200 

SKWKLEFEV FARILRAPES HNVTFGSFVT LHCTATGIPV PTITWIENGN 250 

AVSSGSIQES VKDRVIDSRL QLFITKPGLY TCIATNKHGE KFSTAKAAAT 300 

ISIAEWSKPQ KDNKGYCAQY RGEVCNAVLA KDALVFLOTS YADPEEAQEL 350 

LVHTAWNELK WSPVCRPAA EALLCNHIFQ ECSPGWPTP IPICREYCLA 400 

10 VKELFCAKEW LVMEEKTHRG LYRSEMHLLS VPKCSKLPSM HWDPTACARL 450 

PHLDYNKENL KTFPPMTSSK PSVDIPNLPS SSSSSFSVSP TYSMTVIISI 500 

MSSFAIFVLL TITTLYCCRR RKQWKNKKRE SAAVTLTTLP SELLLDRLHP 550 

NPMYQRMPLL LNPKLLSLEY PRNNIEYVRD IGEGAFGRVF QARAPGLLPY 600 

EPFTMVAVKM LKEEASADMQ ADFQREAALM AEFDNPNIVK LLGVCAVGKP 650 

15 MCLLFEYMAY GDLNEFLRSM SPHTVCSLSH SDLSMRAQVS SPGPPPLSCA 7 00 

EQLCIARQVA AGMAYLSERK FVHRDLATRN CLVGENMWK lADFGLSRWI 750 

YSADYYKANE NDAIPIRWMP PESIFYNRYT TESDVWAYGV VLWEIFSYGL 800 

QPYYGMAHEE VIYYVRDGNI LSCPENCPVE LYNLMRLCWS KLPADRPSFT 850 

SIHRILERMC ERAEGTVSV 869 
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Case 4-31192/CIP 

DECLARATION AND POWER OF ATTORNEY FOR U.S. PATENT APPLICATIONS 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

and 

I believe I am an original, first and joint inventor of the subject matter which is claimed 
and for which a patent is sought on the invention entitled 

Selectable Cell Surface Marker Genes 
the specification of which is attached hereto, 

1 hereby state that 1 have reviewed and understand the contents of the above identified 
specification, including the claims. 

I acknowledge my duty to disclose all information which is known by me to be material to 
the patentability of this application as defined in 37 C.F.R. §1.56. 

I hereby claim the benefit under 35 U.S.C. §1 19(a)-(d) or §365(b) of any foreign 
application(s) for patent or inventor's certificate listed below and under 35 U.S.C. §365(a) of any 
PCT international application(s) designating at least one country other than the United States 
listed below and have also listed below any foreign application(s) for patent or inventor's 
certificate or any PCT international application(s) designating at least one country other than the 
United States for the same subject matter and having a filing date before that of the application 
the priority of which is claimed for that subject matter: 

None 

I hereby claim the benefit under 35 USC §11 9(e) of any United States provisional 
application(s) identified below: 

Application No. Filing Date 

1 . Provisional Application No. 60/1 66,594 

For "Selectable Cell Surface Marker Genes November 19, 1999 

2. Provisional Application No. to be assigned November 19, 1999 

(filed as Application No. 09/444,038 for 
"Selective Marker Genes," and subject 



to a Petition for Conversion to Provisional 
Application, filed November 16, 2000) 
3. Provisional Application No. to be assigned March 30, 2000 

(filed as Application No. 09/539,248 for 
"Selectable Marker Genes," and subject 
to a Petition for Conversion to 
Provisional Application, filed November 
16, 2000). 

I hereby claim the benefit under 35 U.S.C. §120 of any United States application(s) listed 
below and under 35 U.S.C. §365(c) of any PCT international application(s) designating the 
United States listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in said prior application(s) in the manner required by the first 
paragraph of 35 U.S.C. §1 12, 1 acknowledge the duty to disclose all information known by me to 
be material to patentability as defined in 37 C.F.R. §1 .56 which became available between the 
filing date(s) of the prior application(s) and the national or PCT international filing date of this 
application: 

None 

I hereby appoint the attorneys and agents associated with Customer No. 001095, 
respectively and individually, as my attorneys and agents, with full power of substitution and 
revocation, to prosecute this application and to transact all business in the Patent and 
Trademark Office connected therewith. 

Please address all communications to the address associated with Customer No. 
001095, which is currently Thomas Hoxie, Novartis Corporation, Patent and Trademark Dept., 
564 Morris Avenue, Summit, NJ 07901-1027. 

1 hereby declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. §1001 and that such willful false 
statements may jeopardize the validity of the application or any patent issued thereon. 
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FIRST JOINT INVENTOR: 

Full name Susanne Dagmar Pippig 

Signature : 

Date : 

(MM/DD/YY) 

Citizenship : Germany 

Residence : San Francisco, California 

P.O. Address : 1495 7th Avenue 

Apartment 2 

San Francisco, California 94122 

SECOND JOINT INVENTOR: 

Full name Gabor Veres 

Signature : ^ 

Date : 

(MM/DDA^) 

Citizenship : Hungary 

Residence : Palo Alto, California 

P.O. Address : 112 Walter Hays Drive 

Palo Alto, California 94303 

IMPORTANT: Before this declaration is signed, the patent application (the specification, the 
claims and this declaration) must be read and understood by each person signing it, and no 
changes may be made in the application after this declaration has been signed. 
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31192prj .ST25.txt 
SEQUENCE LISTING 



<110> NOVARTIS AG 
VERES, GABOR 
PIPPIG, SUSANNE 

<120> selectable cell surface marker genes 

<130> 4-31192 

<150> us 60/166594 
<151> 1999-11-19 

<150> us 09/539248 
<151> 2000-03-30 

<160> 16 

<170> Patentin version 3.0 

<210> 1 

<211> 3633 

<212> DNA 

<213> EGFR 

<400> 1 

atgcgaccct ccgggacggc cggggcagcg ctcctggcgc tgctggctgc gctctgcccg 
60 

gcgagtcggg ctctggagga aaagaaagtt tgccaaggca cgagtaacaa gctcacgcag 1 
20 

ttgggcactt ttgaagatca ttttctcagc ctccagagga tgttcaataa ctgtgaggtg 1 
80 

gtccttggga atttggaaat tacctatgtg cagaggaatt atgatctttc cttcttaaag 2 
40 



accatccagg aggtggctgg ttatgtcctc attgccctca acacagtgga gcgaattcct 3 
00 



ttggaaaacc tgcagatcat cagaggaaat atgtactacg aaaattccta tgccttagca 3 
60 



gtcttatcta actatgatgc aaataaaacc ggactgaagg agctgcccat gagaaattta 4 
20 




caggaaatcc tgcatggcgc cgtgcggttc agcaacaacc ctgccctgtg caacgtggag 4 
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31192prj .ST25.txt 

80 



agcatccagt ggcgggacat agtcagcagt gactttctca gcaacatgtc gatggacttc 5 
40 



cagaaccacc tgggcagctg ccaaaagtgt gatccaagct gtcccaatgg gagctgctgg 6 
GO 



ggtgcaggag aggagaactg ccagaaactg accaaaatca tctgtgccca gcagtgctcc 6 
60 



gggcgctgcc gtggcaagtc ccccagtgac tgctgccaca accagtgtgc tgcaggctgc 7 
20 



acaggccccc gggagagcga ctgcctggtc tgccgcaaat tccgagacga agccacgtgc 7 
80 

aaggacacct gccccccact catgctctac aaccccacca cgtaccagat ggatgtgaac 8 
40 

cccgagggca aatacagctt tggtgccacc tgcgtgaaga agtgtccccg taattatgtg 9 
00 

gtgacagatc acggctcgtg cgtccgagcc tgtggggccg acagctatga gatggaggaa 9 
60 



gacggcgtcc gcaagtgtaa gaagtgcgaa gggccttgcc gcaaagtgtg taacggaata 10 
20 



ggtattggtg aatttaaaga ctcactctcc ataaatgcta cgaatattaa acacttcaaa 10 
80 

aactgcacct ccatcagtgg cgatctccac atcctgccgg tggcatttag gggtgactcc 11 
40 



ttcacacata ctcctcctct ggatccacag gaactggata ttctgaaaac cgtaaaggaa 12 
00 



atcacagggt ttttgctgat tcaggcttgg cctgaaaaca ggacggacct ccatgccttt 12 
60 



gagaacctag aaatcatacg cggcaggacc aagcaacatg gtcagttttc tcttgcagtc 13 
20 



gtcagcctga acataacatc cttgggatta cgctccctca aggagataag tgatggagat 13 
80 



gtgataattt caggaaacaa aaatttgtgc tatgcaaata caataaactg gaaaaaactg 14 
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40 

tttgggacct ccggtcagaa aaccaaaatt ataagcaaca gaggtgaaaa cagctgcaag 15 
00 

gccacaggcc aggtctgcca tgccttgtgc tcccccgagg gctgctgggg cccggagccc 15 
60 

agggactgcg tctcttgccg gaatgtcagc cgaggcaggg aatgcgtgga caagtgcaag 16 
20 

cttctggagg gtgagccaag ggagtttgtg gagaactctg agtgcataca gtgccaccca 16 
80 

gagtgcctgc ctcaggccat gaacatcacc tgcacaggac ggggaccaga caactgtatc 17 
40 

cagtgtgccc actacattga cggcccccac tgcgtcaaga cctgcccggc aggagtcatg 18 
00 

ggagaaaaca acaccctggt ctggaagtac gcagacgccg gccatgtgtg ccacctgtgc 18 
60 

catccaaact gcacctacgg atgcactggg ccaggtcttg aaggctgtcc aacgaatggg 19 
20 

cctaagatcc cgtccatcgc cactgggatg gtgggggccc tcctcttgct gctggtggtg 19 
80 

gccctgggga tcggcctctt catgcgaagg cgccacatcg ttcggaagcg cacgctgcgg 20 
40 

aggctgctgc aggagaggga gcttgtggag cctcttacac ccagtggaga agctcccaac 21 
00 

caagctctct tgaggatctt gaaggaaact gaattcaaaa agatcaaagt gctgggctcc 21 
60 

ggtgcgttcg gcacggtgta taagggactc tggatcccag aaggtgagaa agttaaaatt 22 
20 

cccgtcgcta tcaaggaatt aagagaagca acatctccga aagccaacaa ggaaatcctc 22 
80 

gatgaagcct acgtgatggc cagcgtggac aacccccacg tgtgccgcct gctgggcatc 23 
40 

tgcctcacct ccaccgtgca actcatcacg cagctcatgc ccttcggctg cctcctggac 24 
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00 

tatgtccggg aacacaaaga caatattggc tcccagtacc tgctcaactg gtgtgtgcag 24 
60 

atcgcaaagg gcatgaacta cttggaggac cgtcgcttgg tgcaccgcga cctggcagcc 25 
20 

aggaacgtac tggtgaaaac accgcagcat gtcaagatca cagattttgg gctggccaaa 25 
80 

ctgctgggtg cggaagagaa agaataccat gcagaaggag gcaaagtgcc tatcaagtgg 26 
40 

atggcattgg aatcaatttt acacagaatc tatacccacc agagtgatgt ctggagctac 27 
00 

ggggtgaccg tttgggagtt gatgaccttt ggatccaagc catatgacgg aatccctgcc 27 
60 

agcgagatct cctccatcct ggagaaagga gaacgcctcc ctcagccacc catatgtacc 28 
20 

atcgatgtct acatgatcat ggtcaagtgc tggatgatag acgcagatag tcgcccaaag 2 8 
80 

ttccgtgagt tgatcatcga attctccaaa atggcccgag acccccagcg ctaccttgtc 2 9 
40 

attcaggggg atgaaagaat gcatttgcca agtcctacag actccaactt ctaccgtgcc 3 0 
00 

ctgatggatg aagaagacat ggacgacgtg gtggatgccg acgagtacct catcccacag 3 0 
60 

cagggcttct tcagcagccc ctccacgtca cggactcccc tcctgagctc tctgagtgca 31 
20 

accagcaaca attccaccgt ggcttgcatt gatagaaatg ggctgcaaag ctgtcccatc 31 
80 

aaggaagaca gcttcttgca gcgatacagc tcagacccca caggcgcctt gactgaggac 32 
40 

agcatagacg acaccttcct cccagtgcct gaatacataa accagtccgt tcccaaaagg 33 
00 

cccgctggct ctgtgcagaa tcctgtctat cacaatcagc ctctgaaccc cgcgcccagc 33 
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60 



agagacccac actaccagga cccccacagc actgcagtgg gcaaccccga gtatctcaac 34 
20 



actgtccagc ccacctgtgt caacagcaca ttcgacagcc ctgcccactg ggcccagaaa 34 
80 



ggcagccacc aaattagcct ggacaaccct gactaccagc aggacttctt tcccaaggaa 3 5 
40 



gccaagccaa atggcatctt taagggctcc acagctgaaa atgcagaata cctaagggtc 3 6 
00 



gcgccacaaa gcagtgaatt tattggagca tga 36 



33 




<210> 


2 


<211> 


1210 


<212> 


PRT 


<213> 


EGFR 


<400> 


2 



Met Arg Pro Ser Gly Thr 
1 5 

Ala Leu Cys Pro Ala Ser 
20 

Gly Thr Ser Asn Lys Leu 
35 

Leu Ser Leu Gin Arg Met 
50 

Leu Glu lie Thr Tyr Val 
65 70 

Thr He Gin Glu Val Ala 

85 

Glu Arg He Pro Leu Glu 
100 

Tyr Glu Asn Ser Tyr Ala 
115 



Ala Gly Ala Ala Leu 
10 

Arg Ala Leu Glu Glu 
25 

Thr Gin Leu Gly Thr 
40 

Phe Asn Asn Cys Glu 
55 

Gin Arg Asn Tyr Asp 

75 

Gly Tyr Val Leu He 
90 

Asn Leu Gin He He 
105 

Leu Ala Val Leu Ser 
120 



Leu Ala Leu Leu Ala 
15 

Lys Lys Val Cys Gin 
30 

Phe Glu Asp His Phe 
45 

Val Val Leu Gly Asn 
60 

Leu Ser Phe Leu Lys 

80 

Ala Leu Asn Thr Val 
95 

Arg Gly Asn Met Tyr 
110 

Asn Tyr Asp Ala Asn 
125 



Page 5 



31192prj , ST25.txt 



Lys Thr Gly Leu 
130 

His Gly Ala Val 
145 

Ser lie Gin Trp 



Ser Met Asp Phe 
180 

Ser Cys Pro Asn 
195 

Lys Leu Thr Lys 
210 

Gly Lys Ser Pro 
225 



Lys Glu Leu Pro 
135 

Arg Phe Ser Asn 
150 

Arg Asp lie Val 
165 

Gin Asn His Leu 



Gly Ser Cys Trp 
200 

lie lie Cys Ala 
215 

Ser Asp Cys Cys 
230 



Met Arg Asn Leu 
140 

Asn Pro Ala Leu 
155 

Ser Ser Asp Phe 
170 

Gly Ser Cys Gin 
185 

Gly Ala Gly Glu 



Gin Gin Cys Ser 
220 

His Asn Gin Cys 
235 



Gin Glu lie Leu 



Cys Asn Val Glu 
160 

Leu Ser Asn Met 
175 

Lys Cys Asp Pro 
190 

Glu Asn Cys Gin 
205 

Gly Arg Cys Arg 



Ala Ala Gly Cys 
240 



Thr Gly Pro Arg 



Glu Ala Thr Cys 
260 

Thr Thr Tyr Gin 
275 

Ala Thr Cys Val 
290 

Gly Ser Cys Val 
305 

Asp Gly Val Arg 



Cys Asn Gly lie 
340 

Ala Thr Asn lie 
355 

Leu His lie Leu 
370 



Glu Ser Asp Cys 
245 

Lys Asp Thr Cys 



Met Asp Val Asn 
280 

Lys Lys Cys Pro 
295 

Arg Ala Cys Gly 
310 

Lys Cys Lys Lys 
325 

Gly He Gly Glu 



Lys His Phe Lys 
360 

Pro Val Ala Phe 
375 



Leu Val Cys Arg 
250 

Pro Pro Leu Met 
265 

Pro Glu Gly Lys 



Arg Asn Tyr Val 
300 

Ala Asp Ser Tyr 
315 

Cys Glu Gly Pro 
330 

Phe Lys Asp Ser 
345 

Asn Cys Thr Ser 



Arg Gly Asp Ser 
380 



Lys Phe Arg Asp 
255 

Leu Tyr Asn Pro 
270 

Tyr Ser Phe Gly 
285 

Val Thr Asp His 



Glu Met Glu Glu 
320 

Cys Arg Lys Val 
335 

Leu Ser He Asn 
350 

He Ser Gly Asp 
365 

Phe Thr His Thr 



Page 6 



31192prj .ST25.txt 



Pro Pro Leu Asp Pro Gin Glu Leu Asp lie Leu Lys Thr Val Lys Glu 
385 390 395 400 

lie Thr Gly Phe Leu Leu lie Gin Ala Trp Pro Glu Asn Arg Thr Asp 

405 410 415 

Leu His Ala Phe Glu Asn Leu Glu lie lie Arg Gly Arg Thr Lys Gin 
420 425 430 

His Gly Gin Phe Ser Leu Ala Val Val Ser Leu Asn lie Thr Ser Leu 
435 440 445 

Gly Leu Arg Ser Leu Lys Glu lie Ser Asp Gly Asp Val lie lie Ser 
450 455 460 

Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr lie Asn Trp Lys Lys Leu 
465 470 475 480 

Phe Gly Thr Ser Gly Gin Lys Thr Lys lie lie Ser Asn Arg Gly Glu 

485 490 495 

Asn Ser Cys Lys Ala Thr Gly Gin Val Cys His Ala Leu Cys Ser Pro 
500 505 510 

Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
515 520 525 

Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Asn Leu Leu Glu Gly 
530 535 540 

Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys lie Gin Cys His Pro 
545 550 555 560 

Glu Cys Leu Pro Gin Ala Met Asn lie Thr Cys Thr Gly Arg Gly Pro 

565 570 575 

Asp Asn Cys lie Gin Cys Ala His Tyr lie Asp Gly Pro His Cys Val 
580 585 590 

Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
595 600 605 

Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
610 615 620 

Thr Tyr Gly Cys Thr Gly Pro Gly Leu Glu Gly Cys Pro Thr Asn Gly 
625 630 635 640 
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31192prj .ST25.txt 



Pro Lys lie Pro Ser lie Ala Thr Gly Met Val Gly Ala Leu Leu Leu 

645 650 655 

Leu Leu Val Val Ala Leu Gly lie Gly Leu Phe Met Arg Arg Arg His 
660 665 670 

lie Val Arg Lys Arg Thr Leu Arg Arg Leu Leu Gin Glu Arg Glu Leu 
675 680 685 

Val Glu Pro Leu Thr Pro Ser Gly Glu Ala Pro Asn Gin Ala Leu Leu 
690 695 700 

Arg lie Leu Lys Glu Thr Glu Phe Lys Lys lie Lys Val Leu Gly Ser 
705 710 715 720 

Gly Ala Phe Gly Thr Val Tyr Lys Gly Leu Trp He Pro Glu Gly Glu 

725 730 735 

Lys Val Lys He Pro Val Ala He Lys Glu Leu Arg Glu Ala Thr Ser 
740 745 750 

Pro Lys Ala Asn Lys Glu He Leu Asp Glu Ala Tyr Val Met Ala Ser 
755 760 765 

Val Asp Asn Pro His Val Cys Arg Leu Leu Gly He Cys Leu Thr Ser 
770 775 780 

Thr Val Gin Leu He Thr Gin Leu Met Pro Phe Gly Cys Leu Leu Asp 
785 790 795 800 

Tyr Val Arg Glu His Lys Asp Asn He Gly Ser Gin Tyr Leu Leu Asn 

805 810 815 

Trp Cys Val Gin He Ala Lys Gly Met Asn Tyr Leu Glu Asp Arg Arg 
820 825 830 

Leu Val His Arg Asp Leu Ala Ala Arg Asn Val Leu Val Lys Thr Pro 
835 840 845 

Gin His Val Lys He Thr Asp Phe Gly Leu Ala Lys Leu Leu Gly Ala 
850 855 860 

Glu Glu Lys Glu Tyr His Ala Glu Gly Gly Lys Val Pro He Lys Trp 
865 870 875 880 

Met Ala Leu Glu Ser He Leu His Arg He Tyr Thr His Gin Ser Asp 

885 890 895 
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Val Trp Ser Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ser 
900 905 910 

Lys Pro Tyr Asp Gly lie Pro Ala Ser Glu lie Ser Ser lie Leu Glu 
915 920 925 

Lys Gly Glu Arg Leu Pro Gin Pro Pro lie Cys Thr lie Asp Val Tyr 
930 935 940 

Met lie Met Val Lys Cys Trp Met lie Asp Ala Asp Ser Arg Pro Lys 
945 950 955 960 

Phe Arg Glu Leu lie lie Glu Phe Ser Lys Met Ala Arg Asp Pro Gin 

965 970 975 

Arg Tyr Leu Val lie Gin Gly Asp Glu Arg Met His Leu Pro Ser Pro 
980 985 990 

Thr Asp Ser Asn Phe Tyr Arg Ala Leu Met Asp Glu Glu Asp Met Asp 
995 1000 1005 

Asp Val Val Asp Ala Asp Glu Tyr Leu lie Pro Gin Gin Gly Phe 
1010 1015 1020 

Phe Ser Ser Pro Ser Thr Ser Arg Thr Pro Leu Leu Ser Ser Leu 
1025 1030 1035 

Ser Ala Thr Ser Asn Asn Ser Thr Val Ala Cys lie Asp Arg Asn 
1040 1045 1050 

Gly Leu Gin Ser Cys Pro lie Lys Glu Asp Ser Phe Leu Gin Arg 
1055 1060 1065 

Tyr Ser Ser Asp Pro Thr Gly Ala Leu Thr Glu Asp Ser lie Asp 
1070 1075 1080 

Asp Thr Phe Leu Pro Val Pro Glu Tyr lie Asn Gin Ser Val Pro 
1085 1090 1095 

Lys Arg Pro Ala Gly Ser Val Gin Asn Pro Val Tyr His Asn Gin 
1100 1105 1110 

Pro Leu Asn Pro Ala Pro Ser Arg Asp Pro His Tyr Gin Asp Pro 
1115 1120 1125 

His Ser Thr Ala Val Gly Asn Pro Glu Tyr Leu Asn Thr Val Gin 
1130 1135 1140 
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Pro Thr Cys Val Asn Ser Thr Phe Asp Ser Pro Ala His Trp Ala 
1145 1150 1155 

Gin Lys Gly Ser His Gin lie Ser Leu Asp Asn Pro Asp Tyr Gin 
1160 1165 1170 

Gin Asp Phe Phe Pro Lys Glu Ala Lys Pro Asn Gly lie Phe Lys 
1175 1180 1185 

Gly Ser Thr Ala Glu Asn Ala Glu Tyr Leu Arg Val Ala Pro Gin 
1190 1195 1200 

Ser Ser Glu Phe lie Gly Ala 
1205 1210 

<210> 3 

<211> 31 

<212> DNA 

<213> primer 

<400> 3 

ctaggctagc atgcgaccct ccgggacggc c 
31 



<210> 4 

<211> 42 

<212> DNA 

<213> primer 

<400> 4 

ctctgcccgg cgactcgggc tgacagctat gagatggagg aa 
42 



<210> 5 

<211> 39 

<212> DNA 

<213> primer 

<400> 5 

ttcctccatc tcatagctag cccgactcgc cgggcagag 
39 



<210> 6 
<211> 31 
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<212> DNA 
<213> primer 
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<400> 6 

ggatatccta cgtgcgcttc cgaacgatgt g 
31 



<210> 7 

<211> 2607 

<212> DNA 

<213> MuSK 

<400> 7 

atgagagagc tcgtcaacat tccactggta catattctta ctctggttgc cttcagcgga 
60 

actgagaaac ttccaaaagc tcctgtcatc accactcctc ttgaaacagt ggatgcctta 1 
20 

gttgaagaag tggctacttt catgtgtgca gtggaatcct acccccagcc tgagatttcc 1 
80 

tggactagaa ataaaattct cattaaactc tttgacaccc ggtacagcat ccgggagaat 2 
40 

gggcagctcc tcaccatcct gagtgtggaa gacagtgatg atggcattta ctgctgcacg 3 
00 

gccaacaatg gtgtgggagg agctgtggag agttgtggag ccctgcaagt gaagatgaaa 3 
60 

cctaaaataa ctcgtcctcc cataaatgtg aaaataatag agggattaaa agcagtccta 4 
20 

ccatgtacta caatgggtaa tcccaaacca tcagtgtctt ggataaaggg agacagccct 4 
80 

ctcagggaaa attcccgaat tgcagttctt gaatctggga gcttgaggat tcataacgta 5 
40 

caaaaggaag atgcaggaca gtatcgatgt gtggcaaaaa acagcctcgg gacagcatat 6 
00 

tccaaagtgg tgaagctgga atttgaggtt tttgccagga tcctgcgggc tcctgaatcc 6 
60 

cacaatgtca cctttggctc ctttgtgacc ctgcactgta cagcaacagg cattcctgtc 7 
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20 

cccaccatca cctggattga aaacggaaat gctgtttctt ctgggtccat tcaagagagt 7 
80 

gtgaaagacc gagtgattga ctcaagactg cagctgttta tcaccaagcc aggactctac 8 
40 

acatgcatag ctaccaataa gcatggggag aagttcagta ctgccaaggc tgcagccacc 9 
00 

atcagcatag cagaatggag taaaccacag aaagataaca aaggctactg cgcccagtac 9 
60 

agaggggagg tgtgtaatgc agtcctggca aaagatgctc ttgtttttct caacacctcc 10 
20 

tatgcggacc ctgaggaggc ccaagagcta ctggtccaca cggcctggaa tgaactgaaa 10 
80 

gtagtgagcc cagtctgccg gccagctgct gaggctttgt tgtgtaacca catcttccag 11 
40 

gagtgcagtc ctggagtagt gcctactcct attcccattt gcagagagta ctgcttggca 12 
00 

gtaaaggagc tcttctgcgc aaaagaatgg ctggtaatgg aagagaagac ccacagagga 12 
60 

ctctacagat ccgagatgca tttgctgtcc gtgccaaaat gcagcaagct tcccagcatg 13 
20 

cattgggacc ccacggcctg tgccagactg ccacatctag attataacaa agaaaaccta 13 
80 

aaaacattcc caccaatgac gtcctcaaag ccaagtgtgg acattccaaa tctgccttcc 14 
40 

tcctcctctt cttccttctc tgtctcacct acatactcca tgactgtaat aatctccatc 15 
00 

atgtccagct ttgcaatatt tgtgcttctt accataacta ctctctattg ctgccgaaga 15 
60 

agaaaacaat ggaaaaataa gaaaagagaa tcagcagcag taaccctcac cacactgcct 16 
20 

tctgagctct tactagatag acttcatccc aaccccatgt accagaggat gccgctcctt 16 
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80 

ctgaacccca aattgctcag cctggagtat ccaaggaata acattgaata tgtgagagac 17 
40 

atcggagagg gagcgtttgg aagggtgttt caagcaaggg caccaggctt acttccctat 18 
00 

gaacctttca ctatggtggc agtaaagatg ctcaaagaag aagcctcggc agatatgcaa 18 
60 

gcggactttc agagggaggc agccctcatg gcagaatttg acaaccctaa cattgtgaag 19 
20 

ctattaggag tgtgtgctgt cgggaagcca atgtgcctgc tctttgaata catggcctat 19 
80 

ggtgacctca atgagttcct ccgcagcatg tcccctcaca ccgtgtgcag cctcagtcac 2 0 
40 

agtgacttgt ctatgagggc tcaggtctcc agccctgggc ccccacccct ctcctgtgct 21 
00 

gagcagcttt gcattgccag gcaggtggca gctggcatgg cttacctctc agaacgtaag 21 
60 

tttgttcacc gagatttagc caccaggaac tgcctggtgg gcgagaacat ggtggtgaaa 22 
20 

attgccgact ttggcctctc caggaacatc tactcagcag actactacaa agctaatgaa 22 
80 

aacgacgcta tccctatccg ttggatgcca ccagagtcca ttttttataa ccgctacact 23 
40 

acagagtctg atgtgtgggc ctatggcgtg gtcctctggg agatcttctc ctatggcctg 24 
00 

cagccctact atgggatggc ccatgaggag gtcatttact acgtgcgaga tggcaacatc 24 
60 

ctctcctgcc ctgagaactg ccccgtggag ctgtacaatc tcatgcgtct atgttggagc 2 5 
20 

aagctgcctg cagacagacc cagtttcacc agtattcacc gaattctgga acgcatgtgt 25 
80 

gagagggcag agggaactgt gagtgtc 26 
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<210> 8 

<211> 869 

<212> PRT 

<213> MuSK 

<400> 8 

Met Arg Glu Leu Val Asn lie Pro Leu Val His lie Leu Thr Leu Val 
15 10 15 

Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val lie Thr Thr 
20 25 30 

Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Val Ala Thr Phe Met 
35 40 45 

Cys Ala Val Glu Ser Tyr Pro Gin Pro Glu lie Ser Trp Thr Arg Asn 
50 55 60 

Lys lie Leu lie Lys Leu Phe Asp Thr Arg Tyr Ser lie Arg Glu Asn 
65 70 75 80 

Gly Gin Leu Leu Thr lie Leu Ser Val Glu Asp Ser Asp Asp Gly lie 

85 90 95 

Tyr Cys Cys Thr Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 
100 105 110 

Gly Ala Leu Gin Val Lys Met Lys Pro Lys lie Thr Arg Pro Pro lie 
115 120 125 

Asn Val Lys lie lie Glu Gly Leu Lys Ala Val Leu Pro Cys Thr Thr 
130 135 140 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp lie Lys Gly Asp Ser Pro 
145 150 155 160 

Leu Arg Glu Asn Ser Arg lie Ala Val Leu Glu Ser Gly Ser Leu Arg 

165 170 175 

lie His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 
180 185 190 

Lys Asn Ser Leu Gly Thr Ala Tyr Ser Lys Val Val Lys Leu Glu Phe 
195 200 205 
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Glu Val Phe Ala 
210 

Phe Gly Ser Phe 
225 

Pro Thr lie Thr 



lie Gin Glu Ser 
260 

Phe lie Thr Lys 
275 

Gly Glu Lys Phe 
290 

Glu Trp Ser Lys 
305 

Arg Gly Glu Val 



Leu Asn Thr Ser 
340 

His Thr Ala Trp 
355 

Ala Ala Glu Ala 
370 

Gly Val Val Pro 
385 

Val Lys Glu Leu 



Thr His Arg Gly 
420 

Lys Cys Ser Lys 
435 

Arg Leu Pro His 
450 



31192 

Arg lie Leu Arg 
215 

Val Thr Leu His 
230 

Trp lie Glu Asn 
245 

Val Lys Asp Arg 



Pro Gly Leu Tyr 
280 

Ser Thr Ala Lys 
295 

Pro Gin Lys Asp 
310 

Cys Asn Ala Val 
325 

Tyr Ala Asp Pro 



Asn Glu Leu Lys 
360 

Leu Leu Cys Asn 
375 

Thr Pro lie Pro 
390 

Phe Cys Ala Lys 
405 

Leu Tyr Arg Ser 



Leu Pro Ser Met 
440 

Leu Asp Tyr Asn 
455 



prj .ST25 . txt 

Ala Pro Glu Ser 
220 

Cys Thr Ala Thr 
235 

Gly Asn Ala Val 
250 

Val lie Asp Ser 
265 

Thr Cys lie Ala 



Ala Ala Ala Thr 
300 

Asn Lys Gly Tyr 
315 

Leu Ala Lys Asp 
330 

Glu Glu Ala Gin 
345 

Val Val Ser Pro 



His He Phe Gin 
380 

lie Cys Arg Glu 
395 

Glu Trp Leu Val 
410 

Glu Met His Leu 
425 

His Trp Asp Pro 



Lys Glu Asn Leu 
460 



His Asn Val Thr 



Gly He Pro Val 
240 

Ser Ser Gly Ser 
255 

Arg Leu Gin Leu 
270 

Thr Asn Lys His 
285 

He Ser He Ala 



Cys Ala Gin Tyr 
320 

Ala Leu Val Phe 
335 

Glu Leu Leu Val 
350 

Val Cys Arg Pro 
365 

Glu Cys Ser Pro 



Tyr Cys Leu Ala 
400 

Met Glu Glu Lys 
415 

Leu Ser Val Pro 
430 

Thr Ala Cys Ala 
445 

Lys Thr Phe Pro 
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Pro Met Thr Ser Ser Lys Pro Ser Val Asp lie Pro Asn Leu Pro Ser 
465 470 475 480 

Ser Ser Ser Ser Ser Phe Ser Val Ser Pro Thr Tyr Ser Met Thr Val 

485 490 495 

He He Ser He Met Ser Ser Phe Ala He Phe Val Leu Leu Thr He 
500 505 510 

Thr Thr Leu Tyr Cys Cys Arg Arg Arg Lys Gin Trp Lys Asn Lys Lys 
515 520 525 

Arg Glu Ser Ala Ala Val Thr Leu Thr Thr Leu Pro Ser Glu Leu Leu 
530 535 540 

Leu Asp Arg Leu His Pro Asn Pro Met Tyr Gin Arg Met Pro Leu Leu 
545 550 555 560 

Leu Asn Pro Lys Leu Leu Ser Leu Glu Tyr Pro Arg Asn Asn He Glu 

565 570 575 

Tyr Val Arg Asp He Gly Glu Gly Ala Phe Gly Arg Val Phe Gin Ala 
580 585 590 

Arg Ala Pro Gly Leu Leu Pro Tyr Glu Pro Phe Thr Met Val Ala Val 
595 600 605 

Lys Met Leu Lys Glu Glu Ala Ser Ala Asp Met Gin Ala Asp Phe Gin 
610 615 620 

Arg Glu Ala Ala Leu Met Ala Glu Phe Asp Asn Pro Asn He Val Lys 
625 630 635 640 

Leu Leu Gly Val Cys Ala Val Gly Lys Pro Met Cys Leu Leu Phe Glu 

645 650 655 

Tyr Met Ala Tyr Gly Asp Leu Asn Glu Phe Leu Arg Ser Met Ser Pro 
660 665 670 

His Thr Val Cys Ser Leu Ser His Ser Asp^ Leu Ser Met Arg Ala Gin 
675 680 685 

Val Ser Ser Pro Gly Pro Pro Pro Leu Ser Cys Ala Glu Gin Leu Cys 
690 695 700 

He Ala Arg Gin Val Ala Ala Gly Met Ala Tyr Leu Ser Glu Arg Lys 
705 710 715 720 
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Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn 

725 730 735 

Met Val Val Lys lie Ala Asp Phe Gly Leu Ser Arg Asn lie Tyr Ser 
740 745 750 

Ala Asp Tyr Tyr Lys Ala Asn Glu Asn Asp Ala lie Pro lie Arg Trp 
755 760 765 

Met Pro Pro Glu Ser lie Phe Tyr Asn Arg Tyr Thr Thr Glu Ser Asp 
770 775 780 

Val Trp Ala Tyr Gly Val Val Leu Trp Glu He Phe Ser Tyr Gly Leu 
785 790 795 800 

Gin Pro Tyr Tyr Gly Met Ala His Glu Glu Val He Tyr Tyr Val Arg 

805 810 815 

Asp Gly Asn He Leu Ser Cys Pro Glu Asn Cys Pro Val Glu Leu Tyr 
820 825 830 

Asn Leu Met Arg Leu Cys Trp Ser Lys Leu Pro Ala Asp Arg Pro Ser 
835 840 845 

Phe Thr Ser He His Arg He Leu Glu Arg Met Cys Glu Arg Ala Glu 
850 855 860 

Gly Thr Val Ser Val 
865 

<210> 9 
<211> 25 
<212> DNA 
<213> primer 

<400> 9 

cgtcctgcgt gagcctggat taatc 
25 



<210> 10 

<211> 25 

<212> DNA 

<213> primer 

<400> 10 

gcctggatta atcatgagag agctc 
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<210> 11 

<211> 42 

<212> DNA 

<213> primer 

<400> 11 

cgaggcctgt cttcaacctt agacactcac agttccctct gc 
42 



<210> 12 

<211> 27 

<212> DNA 

<213> primer 

<400> 12 

cggcctgtgc cagactgcca catctag 
27 



<210> 13 

<211> 32 

<212> DNA 

<213> primer 

<400> 13 

cgtctaggtg agggttactg ctgctgattc tc 
32 



<210> 14 

<211> 38 

<212> DNA 

<213> primer 



<400> 14 

ggttaaccct attcaatgtt attccttgga tactccag 
38 



<210> 15 

<211> 24 

<212> DNA 

<213> primer 
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<400> 15 

cttccaaaag ctcctgtcat cacc 
24 



<210> 16 

<211> 26 

<212> DNA 

<213> primer 

<400> 16 

ccagtcatgg agtatgtagg tgagac 
26 
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